http://www.hpl.hp.com/conferences/icml2003/titlesAndAuthors.html ICML 2003 http://www.hpl.hp.com/conferences/icml2003/papers/212.pdf 99 DISTILL: Learning Domain-Specific Planners by Example aaai abstraction academic acknowledgements acquiring action advanced afrl agency agreement aips algorithm allow alto analogical analyzing approach approximation arbitrarily arti authors automated automatic automatically automating based bauer berlin boston carbonell cation cial cimatti city computing concept conclusions conditional conference contained control current darpa decision defense demonstration design development dissertation doctoral document domains dsplanner dsplanners effective effects either endorsements environments examples executing experimentation explanation expressed fikes force forests france friedland functional generalized generation germany goal hart hendler hierarchy ijcai implementation implied inductive inside intelligence international interpreted italy iwasaki joint journal kambhampati kaufmann khardon kids kluwer knowledge kodratoff laboratory large learned learning linked load logic looping lowry lumberjack machine mccartney method michalski milan minton modi morgan muggleton national necessarily nement nilsson nondeterministic number obdd observed ones operator palo part plan planning plans policies press problems proceedings programming programs projects publishers reactive reasoning references reformulation representing represents research reuse robot robots rocket rocketdomain roveri sacerdoti sara scheduling schmid schoppers search seattle should sixth skeletal small smith software solutions solve spaces sponsored springer state strategies symposium synthesis system table technische tenth that then theory this those toulouse traverso under universal universitat university unload unpredictable used uther validationstructure veloso verlag views volume wards washington while will winner with http://www.hpl.hp.com/conferences/icml2003/papers/76.pdf 25 An Analysis of Rule Evaluation Metrics algorithm analysis application area arti australia between bias bradley bristol cation cestnik cial clark classi cohen computer conference conquer crucial cstr curve curves decision department discovery distance ecai effective environments estimating european evaluation expert fast fawcett ferri flach from furnkranz gamberger geometry guided hernandez heuristics icml imprecise induction inductive intelligence international isometrics journal kaufmann lake lavrac learning letters logic logical machine measures methodology metrics morgan niblett nitions oblinger pattern pitman probabilities proceedings programming provost quanti quinlan recognition references relations report research review robust rule science separate space springer stanford stockholm subgroup sweden sydney tahoe task technical test trees under understand unifying university using verlag view vilalta workshop zupan http://www.hpl.hp.com/conferences/icml2003/papers/374.pdf 68 Using Linear-threshold Algorithms to Combine Multi-class Sub-experts advice algorithms allwein annual approach arti association bakiri berlin bianchi binary block blum brain calendar cesa cial classi codes computational computing conference correcting cortes crammer dietterich domain empirical error expert freund functioning haussler helmbold intelligence international journal kaufmann learning machine machinery majority margin model modern morgan multiclass networks online output perceptron physics problems proceeding reducing references research results reviews schapire scheduling singer solving springer support theory twelfth ultraconservative unifying vapnik vector warmuth weighted winnow http://www.hpl.hp.com/conferences/icml2003/papers/275.pdf 90 Weighted Low-Rank Approximations advances algorithm algorithms analysis anandan another appear application approaches approximation arti azar bayesian bilinear billsus binary bounded cases cial cient circuits class collaborative collins complex component computation computer computing conclusion conference conjugate constant content convergence convex dasgupta data decomposition descent design developed digital dimensional eigentaste estimation european exponential extremely factor factorization family fiat freeman functions gaussians general generalization generalized global goldberg gordon gradient gupta have hessian ieee implement important information intel international irani jaakkola jordan karlin learning least ligence likelihood linear logistic loss ltering lters machine matrices matrix maximum mcsherry method methods midwest minimum mixtureof model models more neural ninth noise other paper parameter pazzani perkins principal problem problems proceedings processing provided provides psychometrika rank references respect retrieval right roeder saia saul schapire schein separating shpak similar simple sixth solve solving some spectral squares statistics style subroutine subroutines such symposium systems tenenbaum that their theory third thirty this time transactions tting uncertainty ungar used using usually variational vision wang weighted well will with works workshop young http://www.hpl.hp.com/conferences/icml2003/papers/282.pdf 109 Learning Decision Tree Classifiers from Attribute Value Taxonomies and Partially Specified Data aaai abstract abstraction adamson advances aggregate aggregation akiba almuallim ambiguous annual approach approximation artificial association attribute attributes background based bayesian chen cheung classification clustering cohen computational concept conference data database databases decision demichiel desjardins dhar discovery distributional domains driven duda efficient engineering english evaluating exploration fayyad feature features first from getoor handling hart hendler hierarchies high honavar hwang ieee imprecise incompatibility induction information intelligence intelligent international issues kaneda kaufmann knowledge koller language learning lecture level levels linguistics machine management mateo mcclean meeting merz mining mismatched morgan multiple national natural network notes ontologybased ontologydriven operations oriented over pattern pazzani pereira performing power press probabilistic proceedings programs quinlan references reformulation relational research resolving rule rules scotney setvalued shapcott sigmod silvescu springer stoffel stork structured symposium systems taylor third thirteenth thirty tishby transactions transcactions tree trees tseng tuzhilin twelfth uncertain using verlag wiley with words workshop yamazaki york zhang http://www.hpl.hp.com/conferences/icml2003/papers/169.pdf 2 Online Choice of Active Learning Algorithms achieved active advice algo algorithm algorithms angluin appears applications auer average banana bandit benchmark benchmarks best bianchi boldface breast campbell cation center cesa ceza christianini ciency classi cohn comb combines committee computing concept conclusions conf cristianini data dataset diabetis each ectively ensemble equation error estimation expert flare freund future generalization german haussler heart helmbold heuristic hochbaum http icml ieee image information jmlr journal koller large learners learning less machine machines margin marked mathematics mccallum members more multiarmed nonstochastic online operations optimal possible presented problem queries query raetsch ratsch reduction references research ringnorm runner sampling schapire schohn selective self sets seung shamir shaw shmoys siam simple smola soft solar splice star support table taylor text that theory through thyroid tishby titanic tong toward transactions twonorm using vector warmuth waveform winner with work http://www.hpl.hp.com/conferences/icml2003/papers/314.pdf 82 Q-Decomposition for Reinforcement Learning Agents aaai abstraction achieving actions advances agent agents algorithms andre annual application approach approximation architecture arti asynchronous athena autonomous bacchus based behavior belmont berkeley bertsekas bled board boutilier brafman brooks building california cambridge canada cial claus collaborative computing conditional conference connectionist control cooperative coordinated cued dayan decisiontheoretic decomposition delayed department dietterich directed dissertation doctoral dynamic dynamics eighteenth eleventh engineering england expected factored feudal fifteenth finfeng fisheries from function functions goldberg graphical grove guestrin harada hierarchical hierarchies hinton ieee information integrating intelligence international invariance inverse joint journal kaufmann koller konolige laboratory lagoudakis learning levandowski line logic machine machines mateo maximizing maxq mdps memo models morgan multiagent multiple multivalued national networked networks neural neuro nineteenth niranjan objective ogasawara opaquetransition optimal otti parr partitioned pirjanian planning policies policy preference press proceedings processing programmable programming ralph real recruitment references reinforcement report representation research reward rewards ricker robot robotics robots rosenblatt rummery ruspini russell scaling scienti seattle selection seventeenth shaping sixteenth slovenia song spatial state stochastic stock stockholm stone structured systems team technical teleoperation tenth theory through time transformations tsitsiklis uncertain uncertainty under university using utilities utility value veloso voting watkins with http://www.hpl.hp.com/conferences/icml2003/papers/355.pdf 37 Transductive Learning via Spectral Graph Partitioning about abstracted additional advances aistats algebra algorithm algorithms applications approximation areas arises avoid based belkin bennett between beyond bipartite blum bounds capacity case cation chapelle chawla ciently classi classify cluster clustering colt combining conclusions concurrent conference consider constrained cotraining cuments cuts data degree dependent derive dhillon ding does duced eigenvalue elds elkopf enco estimating evaluated examples exibility existing focs from functions furthermore gander golub graph greedy gross hagen heuristics icml ieee image improvements individual inductive inference interesting intro jaakkola ject joachims kahng kernels kleinberg kluwer knowledge labeled labeling large learning linear machine machines malik manifolds markov mathematical matt mccallum meila metho method methods metric mincut mitchell modeled modeling more most need nigam nips niyogi normalized opens original other outperforming outperforms over pairwise pami partially particular partitioning possible press principles prior problem problems programming random ratio recognition references relationship relationships require research robust sample scho scholkopf search seen segmentation semi sets settings showed showing since small special spectral splits statistical studied substantial supervised support szummer tardos tasks temporal test text than that their theory this thrun tight trained training transactions transductive tsvm unbalanced unlabeled unlike using value values vapnik variety vector version view walks weston which wiley with words http://www.hpl.hp.com/conferences/icml2003/papers/65.pdf 105 Optimizing Classifier Performance via an Approximation to the Wilcoxon-Mann-Whitney Statistic ability accuracy achieved acknowledgments acoustics across advanced advances advantageous against algorithm algorithms also analysis analytics annealing appear application applied appropriate approximation area associated baluja bartell based bayesian behavior belew biometrics both breuel canada caruana case cases cation class classi clustering colagrosso comparing comparisons computer conclusion conf constrained cottrell cross curve customer data decision demonstrate demonstrated department detection diego directions directly discriminative dissatisfaction distribution document dodier earlier economic empirical entropy environments erentiable error estimation evaluation extend extended fawcett feedback figure finally focus function functions future gain generalisation generalization goal green grimes group guerra hand have here ieee imbalanced improvement improvements improves improving individual induction industry information interesting intl intractable irvine jective john johnson joint kaushansky kohavi large larger layer learning left lower machine magnitude malignancy mann manuscript math maximize mean medical methods miller minimizing mitchell moreau most mozer multi multiple multitask network networks neural nips nonstationary objective obtained optimization optimize optimizing other ovarian over parameters part patterns perceptron performance predict predicting prediction present presented probabilistic problems proc process processing prodding proposed provost psychophysics query random range ranked ranking rankings rankprop rattenbury real references related relevance reliable report requires resulting results retention retrieval risk rutgers salcedo scale science secondly sets several signal similar similarity simple simulated small some sons sort speci speech squared stage statist statistic still stochastically study subscriber subsequent substantial swets symposium system systems technical technology telecommunications test than that theory this thresholds till timmerman train training trans tumors under university unpublished using vandewalle variables variations verrelst version vogt weiss when whether whistler whitney wilcoxon wiley wireless with wolniewicz work workshop world would york http://www.hpl.hp.com/conferences/icml2003/papers/161.pdf 8 The Use of the Ambiguity Decomp osition in Neural Network Ensemble Learning Metho ds aaai abbass academy accurate active advances algorithm analyzing anticorrelation applications approaches arti australia australian bagged bagging bartlett baxter best bias boosting breiman cambridge canberra carney chapter cial classi college combining conf conference connection correlated correlation cross cunningham decimation decomposition decorrelated decorrelation defence dimensionality disser diverse diversity doctoral domingos dublin ensemble ensembles error estimators evolutionary experiments expert feature force francisco frean freund functional generalization generating gradient hypotheses information input intel international issue journal kaufmann krogh large learning ligence ligent lncs machine margin mason mckay members morgan nakano national negative negatively network networks neural nips opitz otago predictors press proc proceedings processing produce reduction references report rosen schapire science selection shavlik south special systems tation technical techniques through trinity tumer tuning ueda university using validation variance vedelsby wales with zealand http://www.hpl.hp.com/conferences/icml2003/papers/335.pdf 47 A Kernel Between Sets of Vectors advances annual arti between bhattacharyya calcutta california cambridge cial classi collins computer conference convex convolution cortes cruz department dirac discrete discriminative distributions divergence expected exploiting generative haussler information intel international invariance jaakkola jebara kernel kernels kondor language learning ligence likelihood math measure mechanics models mohri natural neural ninth oxford populations press principles probability proceedings processing quantum rational references report santa science seventh sixteenth statistical statistics structures systems technical their theory ucsc university workshop http://www.hpl.hp.com/conferences/icml2003/papers/209.pdf 84 Optimization with EM and Expectation-Conjugate-Gradient acceleration algorithm algorithms american analysis association atkinson augmentation censoring chris computation computing conjugate cruz data della dempster effects elds estimates fast features fraction fraley from fruitfly function gene genie gradient http hybrid ieee implementations incomplete inducing information intelligence jamshidian jennrich john journal lafferty laird largest lbnl likelihood linear machine march maximum meng methods missing mixed mixture model models mortaza newton normal pattern performance pietra quasi random references report robert royal rubin santa sequence series simulation society standard stat statistical stephen technical transactions type university using vincent washington with worst http://www.hpl.hp.com/conferences/icml2003/papers/44.pdf 9 Tractable Bayesian Learning of Tree Augmented Naive Bayes Mo dels aaai able accuracy accurate algorithm allow analysis anytime applying approximated approximation aproximatting around arti attributes augmented averaging based bayes bayesian beforehand behavior being belief bene best blake calculate calculation cant carnegie cation cations cerquides characteristics chickering chow cial classi colorado combination comes compare concentration conclusions conditioned conference continue core cost csic cubic data databases dataset datasets decomposable department dependence determinants determine development discovery discrete distribution distributions doing domingos ecml empirical erences erent european finally flairs formalizing friedman from future geiger general goldszmidt good have heckerman here hoeting http ieee iiia implementation improve improvement increase increasing incremental indi induction information instance instances institute intel international introduced jaakkola keogh know knowledge kontkanen langley large leads learned learning ledge library ligence linear long lopez loss lower machine madigan make making mantaras measure meila mellon merz middle mining model most myllymaki naive national need network networks number observations obvious only optimal optimality over pazzani perform performed performing pittsburgh posterior press price probability probably proceedings process processed proposed provide raftery randomly references relative remains report reported reportiiiatr repository reviewed robotics section seen shoup signi silander similar simple size some sort springerverlag sstbmatan stage state statistical statistically statistics stop suitable table tbmatan technical techniques tenth tests that then theory these think this thompson time tirri tractable transactions tree trees under university variance very volinsky whether will with work zero http://www.hpl.hp.com/conferences/icml2003/papers/110.pdf 80 Kernel PLS-SVC for Linear and Nonlinear Classification academic adaboost alcohol algorithm analysis approach barker barr bartlett based beyond bibby bookstein brain build canonical case cation centralized chemometric chemometrics cient clanton class classi collinearity component components computation computationally computations compute continuum correlation data department development developments dimensional discriminant discrimination dunn econometrics ection ects eigenproblem eigenvalue elements emphasis endix engineering extract extraction fact fisher fmri friedman from fyfe gani generalized gram hastie hibbs hilbert honour human ieee intel interface international inverse iterative jection joint jong jorgensen journal kent kernel kernels kres krupka laboratory latent learning least lewi ligent linear lkopf london machine machines mapping mardia margins matrix matthews methods mika modeling more muller multi multimodal multivariate networks neural neurobehavioral neuroelectric neurotoxicology nipals nonlinear number onoda optimization output outputs overview papers part partial pattern perspectives point power prenatal press probability problem processing production provides rayens recognition references regression regularization rehabilitation report reproducing research ricker ridge rosipal ruhe same samples sampson scienti score seattle siam signal simpls skuldsson smola soft solution solved some space springer squares statistical statistics steps streissguth support survey systems technical test tetralogy than that then theory this three thus tibshirani transactions trejo tsch twoblock uncorrelated university used vapnik variables variant vector vectors view vinod washington wegelin weston wheeler wiley wise with wold worsley york http://www.hpl.hp.com/conferences/icml2003/papers/319.pdf 115 Online Convex Programming and Generalized Infinitesimal Gradient Ascent adaptive algorithm algorithms amari analog annual anomolies applications approachability architecture auxilary available bansal behavior bianchi blackwell blum boot boser bounds boyd bregman calibration cambridge cameron carnegie case cesa chawla ciently classi computation computational conference convex cvxbook decision della descent distances duality economic erty fifteenth fifth foster freund functions game games gradient guyon html http ieee introduction learning lelism line linear long margin mathematics mcnally mellon meyerson minimax multiplicative natural networks neural oblivious online optimal optimization paci paral payo pietra playing prediction press problem proceedings programming proof quadratic rand references regret report routing schapire south stanford symposium technical theorem theory training transactions university using vandenberghe vapnik vector vohra warmuth weights works worst http://www.hpl.hp.com/conferences/icml2003/papers/284.pdf 71 Machine Learning using Hyp erkernels adaboost advances airconditioning albert algorithm algorithms alignment american applied approximation auxiliary bartlett bayesian bennett boosting bottom bound bounds bousquet boyd cambridge center cient classi common competition complement complexity computation computer conditions conf conference considered constraint constraints convolutional corresponding correspondingly cortes crammer cristianini cruz data department dependent design digit dimensional discrete discrimination distribution dual elissee energy engineers equality equivalent erentiating estimating experiments figure fine forthcoming francisco free freund from function gaussian ghaoui given gives haussler heating herrmann high hornik http hyperkernels images information inseparable inserting international into introducing inverse ject jective jmlr jordan journal kandola kaufmann kernel kernels keshet lagrange lagrangian lanckriet large learing learning leave leisch lemma linear linearly lkopf machine machines mackay mangasarian margin margins mathematics matrix maximized mean method methods meyer minimax minimize minimizing mining model modelling momma moore morgan muller multipliers musicant networks neural neurocomputing nite niteness nonnegative note novel obtain obtained obtains onoda opper optimization osition pattern penrose platt positive prediction press problem problems proc proceedings processes processing programming prop properties proposition prove pseudoinverses publishers quadratic rank references refrigerating regression report representation research respect result review rewrite robust santa schapire scheinberg schur science search second selection semide serves sets setting shawe siam since singer smola society soft software solution some sparse structures subsequently such support systems target taylor technical term terms test that their then training transcations tsch ucsc under upper using vandenberghe vapnik variable variables vector watson where which williamson winther with wolfe written york zero zhang http://www.hpl.hp.com/conferences/icml2003/papers/231.pdf 30 Correlated-Q Learning adaptive agent agents algorithm algorithms allow annals annual applications approach august aumann based behavior behaviors biology call cally central combining compatible computational concept conclusion conference contained converge convergence converges coop coordination correlated correlation could ctitious decentralized decision design designers discussed draft dynamics economic economics eighteenth eleventh equilibria equilibrium equipping erative ercal erimental evolutionary extreme facilitate fifteenth forcement foster foundation framework freund friend game games gametheoretic general generalized generate given goal gondek greater greenwald have improve incentives international jafari jectivity journal july june known lead learn learning line linear littman machine markov mathematical mathematics minimax miscoordination model morgenstern multiagent nash neumann novemb olicies ongoing only oosting other pages paper past perhaps planner planners play policies population prediction present prespecify press princeton problem procedure proceedings programming qvalues randomized rarely rational references regret reina reinforcement rely repeated replaces replacing research results rewards schapire serve seventeenth similarly simultaneously solution speci specify stochastic strategies studied studying such susceptible system systems szep than that their theoretic theoretical theory these thirteenth this types university values version vohra wellman which with work working would young http://www.hpl.hp.com/conferences/icml2003/papers/184.pdf 66 Identifying Predictive Structures in Relational Data Using Multiple Instance Learning ability acknowledgments actual adaptations advances advantage afrl also although amar ambiguity amherst anonymous another appear application applies approach arti aspects attributes authorized authors axisparallel bags based been best better blau both bunke california cambridge cannot cation chevaleyre cial classi clique comments common computer concept concepts conference contain content contracts copyright could current darpa data database databases decision density department designed developed dietterich different discovery discussions dissertation distance distribute diverse doctoral dooly each effort emphasized evaluation existing express feature features form framework francisco friedland fritts from function goldman government governmental graph graphs grateful grossing hand hannah have help helpful hereon high however human identi identify image immerman important improved inference information instance instead institute intelligence intelligent international irrelevant iteratively jennifer jensen kaufmann kiri knowledge label language large lathrop learner learning letters links lozano machine maron massachusetts mateo maximal maximizes mcgovern methods metric mining modify more morgan movies multiple multipleinstance mutagenesis natural networks neural neville ninth notation notwithstanding number object objects only outgoing paper paris part pattern pearl perez plausible predicting predictive presented press probabilistic probability problem problems proc processing publishers purposes querying ratan real reasoning recognition rectangles references referred related relational relevant report represent represents reprints reproduce respect retrieval reviewers rpts rules scale scaling scene science section shearer sigkdd simpler small solving squared stated strength strengths structure structures subgraph subset successfully such supported systems task tasks technical technique techniques technology that their these they this tree trees under understand understanding understood university updating used using value valued vector version very visual wagstaff well when which will with work zero zhang zucker http://www.hpl.hp.com/conferences/icml2003/papers/302.pdf 67 Planning in the Presence of Cost Functions Controlled by an Adversary aamas ability accounted acting advances adversary algorithm algorithms allows also annual approximately bagnell bazaraa benders bottleneck brown calls cant carnegie cassandra cient column competitive computational conclusions conference cost costs could cplex criteria decision described detail developed dijkstra discrete domains double dynamic each element entities environments erence even example experiment experiments explained explicit faster fewer figure filar finally formulation full future gained gordon have hope http ilog implementation implicit important incremental index indoor information insigni instead institute intend interesting interscience investigate jarvis john kaelbling kernels koenig large learning length likhachev linear littman locating markov master mathematik mellon memory methods missing mixed mobile models more most moving multiplicative need network neural number numerische observable observations online operations optimization oracle oracles outperforms partial partitioning path performance pittsburgh plan planning pomdp practical preference problem problems procedures proceedings processes processing products programming pronounced puterman references report represent representation requiring research reward robotics robots rosencrantz schneider sherali shown shows signi since single small software solution solving sons springer stochastic storing strategies support systems table take takimoto teams technical tested that theory there this thrun time total uenced uncertain university updates uses variable vector vectors version vrieze warmuth weighted when wiley willey with without work would http://www.hpl.hp.com/conferences/icml2003/papers/303.pdf 28 Hierarchical Policy Gradient Algorithms abstraction accelerate achieved acquisition action actor advantages algorithms also applying arti autonomous average bartlett based baxter behavior berkeley between california case children cial class combines conclusions conference continuing continuous continuoustime control critic decision decomposition demonstrated depends describes dietterich dissertation doctoral domain doya ectiveness eighteenth episodic erent estimation fifteenth formulate formulated framework fuel function future ghavamzadeh gradient hierarchical hierarchically hierarchy higher horizon hybrid include intel international investigated involves journal konda learned learning level ligence lower machine mahadevan marbach markov maxq mdps method methods morimoto nineteenth nite notions optimal optimality other overall paper parr performance pgrl policies policy policygradient precup previous problem problems proceedings processes proposed real reasons references reinforcement reported research results reward robot robotics root semi ship simple simulation since singh space spaces stand state steering subtasks sutton systems task taxi temporal that them these this type university using vfrl which with work http://www.hpl.hp.com/conferences/icml2003/papers/345.pdf 52 The Pre-Image Problem in Kernel Metho ds adding advances analysis applied based biometrika burges chapman clustering conference decision diagrams edition feature francisco girolami gower hall ieee international kaufmann kernel learning machine mercer mika monographs morgan muller multidimensional multivariate networks neural noising point probability proceedings ratsch references rules scaling scholkopf scholz second simpli smola space spaces statistics support thirteenth transactions vector http://www.hpl.hp.com/conferences/icml2003/papers/301.pdf 26 Margin Distribution and Learning Algorithms advances algorithm algorithmic analysis anthony approach arriaga bartlett based baxter bayes boosting bounds california cambridge computer concepts conference context correction databases distribution effectiveness explanation explicit extensions figure foundations freund garg generalization golding hilbert http improved information international into irvine john johnson kaufmann langford learning lindenstrauss lipschitz machine mappings margin margins mason methods mlearn modern morgan murphy network neural optimization peled probability proc processing projection random references report repository robust roth schapire science sensitive shawe sketch sons space spelling statistical systems taylor technical theoretical theory through university vapnik vempala voting wiley winnow york http://www.hpl.hp.com/conferences/icml2003/papers/230.pdf 114 Eliminating Class Noise in Large Datasets aaai ablex accuracy accurate acknowledgements acquisition adopted advances after algorithms american anonymous applied approach approaches army artificial authors bain behaves belgium berlin blake boswell breiman brodley brooks certain chan chen clark class classification cleaning columbia comments comparative comparison conclusions conf constructive corp count criterion daad data database databases dataset datasets decision denies detection different disadvantage discovering discovery distinguishing distributed domain domains dzeroski each earlier ecml effective efficient eliminating elimination error evaluate evaluations examples exception exceptions existing experimental experiments extension extensive fact filter filtering fire first francisco friedl friedman from gamberger given good grant groselj guyon handling have higher http icml identification identified identify identifying improvement improving induced induction inductive inform information inherent instance instances intellig intelligence international into john journal kaufmann knowledge kolluri laboratory large lavrac learning less like likely machine mateo matic matrices medical merz meta methods mining mislabeled monotonic more morgan muggleton national netherlands noise noisy number office olshen outliers paper partition partitioning patterns portion preprocessing presented press probability problems proc procedure programs proposed provost pulishing quinlan receiving recent record references regression removed removing repeated report repository research reviewers robust round rule rules same satisfied scalable scaling schemes science selected sets shown shtml society some springer srinivasan stone stopping studies subset subsets supported survey technical thank that their thesis they this threshold training trees under university until used usually values vapnik variables verbaeten verlag version wadsworth with workshop would http://www.hpl.hp.com/conferences/icml2003/papers/79.pdf 36 A Faster Iterative Scaling Algorithm For Conditional Exponential Model aberger above abramowitz according adaptive agichtein agonizing algorithm algorithms always annals appendix approach assuming assumption because beeferman berger bfgs borthwick bound cambridge claims classification comparison computational computer conference conjugate conll constraint corpora darroch diverse dover empirical england entity entropy equation estimation expanded exploiting extension fast filtering following formulas functions further generalized gentle gradient graphs grishman handbook have hold holder holds html http identical ijcai improved induction inequality information introduction iqll item iterative jelinek knowledge lafferty language large learning letting likelihood limited linear linguistics logistic logreg london long machine malouf massachusetts math mathematical maximum mccallum memory merging method methods minka model modeling models moller named natural need network neural nigam nocedal optimization pain papers parameter part pietra positive press printing proceedings processing prog proof prove proved ratcli ratnaparkhi recognition references regression report rosenfeld satisfies scale scaled scaling second segmentation shewchuk simply sixth sources speech stat statistical statistics stegun step sterling supervised tables tagging tech text that therefore this together true upper using very when will with without workshop york http://www.hpl.hp.com/conferences/icml2003/papers/149.pdf 20 Action Elimination and Stopping Conditions for Reinforcement Learning action additional aiello algorithm algorithms almost american anal annual appl apply applying approximate approximated approximately approximation architecture argument arrive associated association assume assumption assumptions athena barto base based basis behaved belmont bertsekas borkar bound bounded bounds cambridge claim clearly competitive completes computation computing concept conclude conference consider constant contraction control convergence converges criteria decision delayed details ding direction directions dissertation doctoral duction dynamic each elimination endix enough estimaltes estimate estimation estimations even every example exbe exists expression extending extension failure finite follow following follows fourteenth fourth from function functions future general have high history holds horizons howard implies important indirect induction inductive inequalities inequality infocom information international interscience iteration iterations iterative jagopolan journal kakade kaufmann kearns kernel kesselman krmax langford large last learning lemma loss lotker lower machine macqueen management mansour mapping markov math maxa method meyn model modi morgan most need neural neurodynamic next nineteenth nite nition nity noise nonnegative norm note number obtain optim optimal order ormoneit osition other over pair pairs patt pectation performed pirical policies press probability probably problems proceedings processes processing programming proof prop proposition prove pseudo puterman queue random rates references reinforcement rentiated repeating requirement requirements requires restricted result rewards rmaa rmax rosen same sample satis schieber scienti second sequence services shamir show shown siam similar since singh size some spaces start state statistical step stochastic such sums suppose sure sutton sviridenko switches symposium systems take term that them then theory there therefore this three thus time trivially tsitsiklis type under union university upper used using value variable variables vector verges vmax watkins well were where which wiley will with zero http://www.hpl.hp.com/conferences/icml2003/papers/10.pdf 41 Representational Issues in Meta-Learning aamodt advances algorithms altho applied approaches armengol avila base based bergmann brazdil breslow carlos case cohen communications conference conversational costa descriptions ective european experience fast foundational induction intel international issues ject kaufman knowledge lazy learning ligence machine management measures methodological morgan muller munoz nick online oriented plaza proceedings ranking reasoning references relational representations rule similarity snoek springer stahl system technology variations workshop http://www.hpl.hp.com/conferences/icml2003/papers/145.pdf 104 Decision-tree Induction from Time-series Data Based on a Standard-example Split Test aaai about akiba algorithm almuallim analyze arti attributes based behaviour belmont berka berlin breiman cation chal challenge chapman cial cient classi conference data decision discovery download drucker ecml ecmlpkdd extraction finding friedman gain geurts goes hall hepatitis hierarchical http induction intel kaneda know learning ledge lenge ligence lisp lnai menlo mining multiple national olshen opponent optimal park pattern pkdd players press principles proceedings ratio references regression robocup robot series soccer split springer stone tests thirteenth time tree trees using verlag world http://www.hpl.hp.com/conferences/icml2003/papers/291.pdf 103 Cross-Entropy Directed Emb edding of Network Data advances algorithm alon although analysis another approach based become been before behavior bell between browsing buja cambridge changes characteristics circle classical clustering cmds coli combine communications complex complicated computational conference congressus connectivity continuous cross cult data date dean develop dimensional dimensionality direct direction directions directly discovery dotted drawing dynamic dzeroski each eades embedded embedding empirically encouraged energy entropy escherichia evaluate existing expected experiments extend figure floyd framework francisco from function future general genetics geometric global graph graphical guide hackers hand have heuristic highly hinton hofmann however html http important incorporated incorporating inducing inductive information inherent instance interactive interface international jordan journal kamada kaufmann kawai laboratories langford langley large layouts learning left letters linear links littman locally machine mangan matlab measure method methods might milo model modeler models more morgan motifs multidimensional must nature neato need neighbor network networks neural newly nineteenth nips nodes nonlinear north number numerantium observed obtained obtaining optimization other over paper papers path picture planing preserving press principle proceedings process processes processing produced promising proposed quite reasonably reduction references regard regularities regulation relational remain research results reveal right roweis sanchez saul scale scaling science scienti seems shared shen shortest shown shows silva smoothness spectral spring statistics step still stochastic structures such support swayne system systems tenenbaum terms than that them theory there these this thus time todorovski tool torgerson toronto transcriptional understand undirected useful user using variables variety visualization vols weiss well which whole wider wiley with works would xgvis york http://www.hpl.hp.com/conferences/icml2003/papers/280.pdf 64 The Cross Entropy metho d for Fast Policy Search acknowledgements actor adaptive advantage algorithms also anderson annals annealing anonymous another appear applied approximation arti asynchronous athena bartlett barto based baxter bertsekas boer borkar cambridge cant cantly cial cient combinatorial computing conclusion conf conference conjugate connectionist considered continuous control convergence created critic cross cult currently cybernetics decision delayed direct dissertation distribution doctoral dynamic easily elements entropy environment error estimate estimation event expected experimentation experiments exploited exploration exponential extension facilitate following framework from future good gradient gradients grateful guaranteeing guided haifa hierarchical homem horizon ieee importance improving includes incorporating initial intel international israel joint journal kaelbling kaufmann kearns knowledge konda kroese learning ligence littman local machine mannor manuscript many markov mechanisms mello method methodology methods meyn models moore more morgan multiagent near need neuro neuronlike nite number operation optim optimal optimization other parameters plenty policies policy polynomial presented press probability problem problems proc proceedings processes programming proximity puterman random rare references reinforcement relation required research reversing reviewers rewards robot robust rosenstein rubinstein sampling scienti search sensitive seventeenth shelf siam signi simple simulated since singh small solve speed static statistical steepest stochastic structure study submitted such survey sutton systems technion than that theoretical there three time transactions tsitsiklis tutorial tweaked university unknown used using watkins weaver weightlifting when which wileyinterscience williams with http://www.hpl.hp.com/conferences/icml2003/papers/173.pdf 3 Learning Logic Programs for Layout Analysis Correction aaai about acquisition action adaptive altamura analysis approach artificial atre automated automatic based berlin between blockeel classification clustering components computer conference constraint control cussens data databases decision declarative dengel dependencies discovery document documents domains down dubiel eighth ekdb eleventh epia esposito european examples extraction first format foundations fourth francisco frisch from hacid huang ieee image images imbalanced incremental induction inductive initial intelligence intelligent international interpretation into invited ismis issues john joint journal kaufmann kautz khardon kise knowledge kodratoff layout learning lecture lisi logic logical lorenzo machine malerba morgan normal notes order otero paper pattern planning portuguese press proceedings programming programs provost raedt recognition recursive references rules second selection selman semeraro sets setting sixth society solutions sons spatial springer srihari strategies structure structures symposium systems theories third thirteenth transforming trees understanding vaqueros walischewski wiley wisdom with workshop zack zighed http://www.hpl.hp.com/conferences/icml2003/papers/296.pdf 48 Visual Learning by Evolutionary Feature Synthesis aaai algorithms application artificial automatic banzhaf based bhanu blackboard brodley coevolution coevolutionary comparison computation computer conference construction corporation danyluk draper eighteenth evolution evolutionary evolving features francisco francone genetic hanson http hypotheses image institute intel intelligence international introduction johnson journal kaufmann keller krawiec learning library luke machine manual massachusetts master morgan nordin pairwise pattern plus pohoreckyj press proceedings processing programming programs projects recognition reference references representation riseman routines scheduling system technology thesis transformation vision visual workshop york http://www.hpl.hp.com/conferences/icml2003/papers/113.pdf 55 The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping abramowitz accelerate accelerated accumulated accurate action actions after agents algorithm algorithms alstr analysis application applied applying approximately average averages based because behaviors berkeley bicycle bled cambridge case central characterizes cohen colombetti computer conclusion conference correspond curve curves data dejong delay delayed dependency developing difference different difficulty distance dorigo dover drive during each edition eleventh encouraging episode episodes every except experiments explanation explore fairly feedback fifteen fifteenth fifth figure first five formulated four from function functions given hall handbook harada have hirsh hogg horizon immediate indicating inference institute intended international interval intervals invariance issues jersey kaufmann kearns last laud learn learning least machine made mataric mathematical meets more morgan must near nineteenth number observe optimal over parameter parameters performance policy polynomial potential practical prentice probability problem proceedings publications randl receives reduced reducing references reinforcement report reward rewards robot russell science shape shaping shavlik showed shows signal simple singh situated sixteenth size slovenia specifically statistical stegun strategy strong structure successful successively tanis technical temporal tesauro that theory thesis this through thus time transformations under university until upon using value varied verifies very walked watkins which will with without york http://www.hpl.hp.com/conferences/icml2003/papers/237.pdf 23 Random Pro jection for High Dimensional Data Clustering: A Cluster Ensemble Approach achlioptas agrawal american analysis applications association automatic bingham cluster clustering combining conference council criteria data database dimensional dimensionality discovery ensembles evaluation framework friendly gehrke ghosh gunopulos high ieee image indexing international jective joint journal kaski know knowledge latent learning ledge machine management mannila mapping methods mining multiple networks neural papadimitriou partitions press principles probabilistic proceedings projection projections raghavan rand random reduction references research reuse semantic seventeenth seventh sigkdd sigmod statistical strehl subspace symposium systems tamaki text twentieth vempala http://www.hpl.hp.com/conferences/icml2003/papers/317.pdf 11 BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games about acapulco adaptive adversarial advice agent algorithm algorithms annals annual appeared approximation arti auer bandit banos bayes bayesian behavior bianchi bowling brafman canada casino cautious certain cesa cial cient classes commerce complexity conference conitzer consistency contributions control convergence ctitious decision decisions diego dover dynamic dynamics earlier econometrica economic economics eighteenth electronic equilibria equilibrium expert extensive focs forecasts form foster framework freund fudenberg gambling game games general gradient hannan haussler helmbold ijcai incomplete information intelligence international internet jehiel john joint journal kalai kearns leads learning lehrer levine littman luce machine making mansour markov mathematical megiddo mexico monderer multi multiagent multiplicative nash near neural nips operations optimal papadimitriou play played players playing polynomial press princeton problem proceedings processing pseudo raiffa randomization rate rational references reinforcement repeated report republication research results rigged risk rule samet sandholm schapire selecting singh sons stanford statistics stoc stochastic stone systems technical tennenholtz theoretical theory time uncertainty university using valuation vancouver variable veloso version vohra warmuth weights wellman wiley with york http://www.hpl.hp.com/conferences/icml2003/papers/111.pdf 18 Using the Triangle Inequality to Accelerate -Means aaai abstract accelerating acceleration acoustics adversary aggarwal alamos alenex algorithm algorithms alsabti also alternatives amir analysis anchors annals annual applications approaches approximate approximately argument arti average based basic behavior best better birch bound burkhard calculations certain cheng choose cial cient circuits classi clustering clusterings colt communications computational computations computer computing condition conference constant construct continuous convergence cosines curse dasgupta data database databases dempster deng dilation dimensional dimensionality discovery distance distances distortion distributed efrat eleventh elkan encoding engineering environments exact experiments explorations faber farnstrom fast faster fifteenth fifth foundations fourteenth fourth francisco from full gasull generaltree geometric geometry gersho gray guarantees hamerly hierarchical hierarchy high hinneburg hodgson icassp icdt ieee image implementation improvement incomplete incorrect indyk inequality information instance intelligence international ipps issues ized jain january joint journal judd kanungo kaufmann keim keller kick knowledge laird large learning least letters lewis likelihood livny lloyd location london lower machine macqueen make management many marques match matching mathematical maximum mckinley mclean mcnames means method metrics mielikainen minimum mining monge monte montolio moore morgan motwani mount multiresolution nding nearest needed neighbor neighbors neighbour neighbours netanyahu novel omits only open opponent optimization orchard oregon other output paliwal parallel partial partitions pattern pelleg performance phillips piatko portland press problem problems proceedings processing projection proximity quantization question ramakrishnan ramamurthi ramasubramanian random ranka reasoning recognition reducing references region regular related remote removing requirements revisited rotated royal rubin samet sanfeliu satisfy scalability scale science scienti search searching second sensing shoham showing sigkdd sigmod signal silverman simple singh sixteenth society some spaces spdp speech springer squares statistical statistics structures surprising survive symposium systems technology tencon that then theoretical theory these time together torres towards transactions triangle twelfth uncertainty using values variance vector verlag very vidal video virginia whether with workshop world zhang http://www.hpl.hp.com/conferences/icml2003/papers/122.pdf 56 Learning with Positive and Unlabeled Examples Using Weighted Logistic Regression aaai agrawal algorithm algorithmic appear athena available based bayardo blum brary bylander cation chang cjlin classi classify colt combining computational computer conf conference crosoft csie data database databases denis dimensional discovery distribution documents estimating example examples extending foundations frieze from functions gilleron high ieee interactive international kannan knowledge labeled learning libsvm linear machine machines madison management mccallum menlo mihttp mining mitchell muggleton nigam noise noisy page park pebl platt polynomial positive presence press proc proceedings queries references report research scholkopf science shawe smola software srikant statistical support symposium taylor technical technology text theory threshold thrun time tommasi training unlabeled using vector vempala williamson with workshop http://www.hpl.hp.com/conferences/icml2003/papers/15.pdf 61 An Evaluation on Feature Selection for Text Clustering advances aggrawal algorithm algorithms american analysis approach artificial association automated based bekkerman bengio blum bottou browsing categorization classification cluster clustering clusters collections convergence cutting dash data dempster dimensional distributional document evidence examples exploratory feature features finding friedman from galavotti gather generalized high incomplete information intelligence intelligent international journal karger laird langley large learning likelihood machine maximum means negative neural pakdd pedersen proc processing projected projection properties pursuit references relevant royal rubin scatter sebastiani selection sigir sigmod simi society spaces stat systems text tishby tukey winter yaniv http://www.hpl.hp.com/conferences/icml2003/papers/247.pdf 44 Characteristics of Long-term Learning in Soar and its Application to the Utility Problem aaai allen altmann altos application approach artificial based blake bostrom california cambridge carbonell carnegie chunking chunks cognition composer computer concerning conference congdon control crandrasekaran databases dejong department discovery dissertation doctoral doorenbos effect effective eleventh eliminating engineering executing expensive expert explanation expressiveness fifth fikes forgetting general generalized george goal gratch group hart harvard heuristic hierarchies holder html http ieee information intelligence international interviewed irvine joint jose june kaufmann kennedy keogh knowledge korf laird large learned learning like long machine macro manual markovitch mason matching mateo mellon menlo merz method michalski minton mitchell mlearn mlrepository model mooney morgan national newell nilsson ninth operators park plans practice press probabilistic problem proceedings production quantitative redundancy references repository restricting results robot role rosenbloom rule rules school science scott search seventh soar solution speed systems tambe technology tenth term theories there unified univ university users utility version weak what workshop http://www.hpl.hp.com/conferences/icml2003/papers/128.pdf 97 Learning Mixture Models with the Latent Maximum Entropy Principle achievable ackley actually adaptive adds algorithm analysis annals annealing another appears association automatically because behaves best boltzmann case chooses cogn comments comput conclusion conditional considered consistency consistent constraint converge convergence converges counteract covariance cover cross darroch data degenerate della demonstrate dempster determinant determining deterministic differential dirichlet distribution effect either elds elements eliminating entropy error estimating estimation estimator eugenics even experiments fact features finite fisher form from future gaussian generalized goes graphical guaranteed have hinton however icml ieee important incomplete increased inducing information intelligence interested iterative jaynes john just known labeling lafferty laird large latent lauritzen learning like likelihood limit linear logic lter machine machines main manuscript mathematical matrix maximum mccallum mclachlan measurements method minimum minka missing mixture model models more multiple nakano networks neural nite nonzero observation order papers pattern peel penalization penalty pereira physics pietra practical preferred principle probabilistic probability problems proceedings programming properties proved random ratchliff rather real reduce references regularization regularizer reidel remains representable requires research resulting riezler rosenkrantz royal rubin same sample samples scaling schuurmans segmenting sejnowski sense sequence show size sizes smaller smoother solution solves sons standard statis statist statistical statistically statistics stronger stuttgart such taxonomic term than that theory thesis thomas though topic transactions tting ueda unbounded under underlying university wang well when where wiley will with zero zhao http://www.hpl.hp.com/conferences/icml2003/papers/226.pdf 24 The Geometry of ROC Space: Understanding Machine Learning Metrics through ROC Isometrics ability accuracy achine acknowledged acknowledgements algorithms also alternative analyse analysis anonymous another application area argued artificial assessment australia avenue belmont between bias boosting both breiman built butterworths central characteristic class classification comments commonly compared completion computing concluding conference conjecture consider considers construction contingency cost criteria curve decision defining demonstrated derived deriving detection discovery discussions distance doddington down drummond during each effective enlightening environments eurospeech evaluation expert explicit exploiting express fawcett fellow ferri finally find finished flach former forward foundation foundations friedman from further gain gamberger generalisation gini gradient gratefully gratitude guided hand have help helpful hernandez heuristics holte icml ijcai important imprecise include including induction inductive information initial insensitive instance intelligence international into isometric isometrics johannes joint journal kamm kaufmann kearns landscape latter lavrac learning like logic london machine made mansour many martin measure measures methodology metric metrics model more morgan most multi multiple national number oblinger obtain obtained obvious olshen ordowski orthogonal paper performance perhaps play plots point points press problems proceedings producing product products programming proposed provides provost przybocki purpose quantification quinlan ratio ratios references regression remarks research results retrieval reviewers rijsbergen rnkranz robust role rule search sensitive sensitivity served simple simplification skew skewsensitive slope south space split splitting springer stone subgroup support symposium tables taken task than thanks that their theory there this through till tree trees turned under unifying university used using variety verlag version view vilalta visiting wadsworth wales ways weakly well where which while with work workshop would zupan http://www.hpl.hp.com/conferences/icml2003/papers/195.pdf 108 Learning Metrics via Discriminant Kernels and Multidimensional Scaling: Toward Exp ected Euclidean Representation advances alignment analysis bach borg bousquet california cambridge cation cients classi complexity computer convolution cristianini cruz department discrete discriminant dissimilarities elissee euclidean ferry gower graphical groenen haussler herrmann information jordan journal kandola kernel kernels learning legendre matrix mercer metric models modern multidimensional neural pattern press processing properties recognition references report santa scaling science shawetaylor springer structures systems target technical ucsc university using verlag with york http://www.hpl.hp.com/conferences/icml2003/papers/144.pdf 106 Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution aaai academic accuracy algorithm analysis archive arrhythmia arti attributes average based berlin blake blum boston cambridge cancer catania categorization chang chemical cial class coil communication comparative computer conference conssf correlation corrsf current customer dash data databases dept dimensional directions discovery discrete discretization dissertation doctoral each eighteenth enabling estimating european examples exponentially extension fcbf feature features fifteenth flannery fourteenth frank full genomic hall high html http huang hussain ieee image implementations intel intelligence international irrelevant isolet issues italy java john joint jordan journal karp kaufmann kira kluwer know kohavi kononenko langley learning ledge ligence lung machine many menlo merz methods microarray mining mitra mlearn mlrepository morgan motoda multi murthy musk national nineteenth numeric numerical open park pattern pederson pracitcal press problem proceedings programs promising promoters publishers quinlan recipes references relevance relevant relief relieff rendell repository representation retention retrieval review sampling science selected selection selective seventeenth similarity splice springer study subset symposium table technique techniques tenth teukolsky text title tools traditional training transactions university unsupervised uscensus using verlag vetterling visual waikato with witten wrappers xing yang http://www.hpl.hp.com/conferences/icml2003/papers/281.pdf 74 Online Feature Selection using Grafting amount blake boser california carried change column comparing computer databases dataset datasets dept each ercentage eriments error excessive exhaustive features figure graft grafting graphs greedy gures guyon html increases information irvine learned learning linear machine measures merz method misclassification mlearn mlrepository model number references relates repository required results science seconds seen show step this three time total train university vapnik various versions weights were wise with http://www.hpl.hp.com/conferences/icml2003/papers/148.pdf 101 Adaptive Feature-Space Conformal Transformation for Imbalanced-Data Learning amari annotation approach aspects bagging based bayes bowyer breiman burges chan chang chawla circuits class classi cohn computer conceptual conference conformal content contentbased cost data description distributed distributions dover dynamical effects functions geometry hall ieee image improving international invariance issue kegelmeyer kernel knowledge learning machine mapping methods mining minority modifying multi multimedia multimodal networks neural notes over point predictors press pubns references retrieval riemann sampling smote soft special stolfo support surfaces sychay synthetic systems technique technology transactions uniform using vector video with workshop http://www.hpl.hp.com/conferences/icml2003/papers/36.pdf 65 The Set Covering Machine with Data-Dependent Half-Spaces achieves acknowledgments advances algorithm algorithms alternative applies arti association asymmetrical balls better bias bottou bound california called cambridge case chervonenkis chvatal cial classi communications comparable compared compression computational computer computers computing conclusion conference covering cross cruz data datadependent datapoints decision department dependent dimension effective effectiveness eighteenth error errors expected extending factor favorable features floyd fold framework freeman frequently furthermore garey generalization generalize generally good grant greedy guide half haussler have hence heuristic hinton holloway however icml important indicates inductive information intelligence international intractability introduced introduction investigate johnson journal kearns kermit learnability learnable learning littlestone london loss machine machinery makes manual marchand massachusetts mathematics model natural neural next note npcompleteness nserc number occurs often only operations outlook pairs part performing practice press problem proceedings processing produces project provide quantifying reasearch reference references relating report research revow royal sample santa saunders schoelkopf science scms seen selecting selection sets shawe shown small smola some sometimes spaces sparser splits step stitson substantially support supported svms symmetrical systems taylor technical technique terms that theory training trees under university using valiant validation vapnik vazirani vector warmuth weston whenever which with work york http://www.hpl.hp.com/conferences/icml2003/papers/278.pdf 77 Tackling the Po or Assumptions of Naive Bayes Text Classifiers analysis application available bayesian berger beyond categorization cation chakrabarti church city class classi codes coding conditions confusion content correcting distribution documentation domingos duda ecml engineering error features from gale ghani godbole hart heckerman http icml ijcai independence interclass interpretation joachims jones journal katz language learning machines many microsoft mixtures modelling multi natural networks optimality output pattern pazzani phrases poisson proceedings references relevant report research retrieval sarawagi scaling scene sigkdd simple sons speci statistical support svmfu technical term text tutorial using vector wiley with words http://www.hpl.hp.com/conferences/icml2003/papers/222.pdf 45 Unsupervised Learning with Permuted Data advances algorithm algorithms among analysis analyzed application applications applied astronomy automating bayes bound brief cadez california cation classi classify closed clustering concluded conclusions construction correlation correspondence data datalab derived duda edition empirical erent error estimation example expressions extensions feature form found framework frey from galaxies galaxy general gold hart http ieee illustration images information intel internet irvine john kamath kirshner lead learning ligence machine matching mccallum mjolsness negative neural nigam notion optimal oriented orienting paper pappu parise pattern perform permutation permutations permuted point portals pose positive possible presence press problem problems processing rangara rate real references related rennie report retrieval second several seymore shapes shows smyth sons speci stork systems technical that there this transactions transformationinvariant university unknown unsupervised used usefully using values variables very well wiley with work world york http://www.hpl.hp.com/conferences/icml2003/papers/106.pdf 12 Semi-Supervised Learning of Mixture Models academic active adaptive additional algorithmic american analysis annals appear applications arti assoc asymptotic baluja based basford bayesian behavior berk berkeley biometrics biometrika both bruce carnegie castelli categories cation cial ciency cirelo classi clustering cohen collins colt combining comite comparison computation computer computers conditions conf continuations cooper corduneanu cover cozman data degrade dekker denis devroye different discriminant discrimination dissertation distributions documents econometrica edinburgh effect electrical employing engineering enhancing entity entropy errors estimates estimation examples experts exponential expression fabio face facial fifth florida freeman friedman from function ganesalingam geiger generative geoscience ghani gilleron goldman goldszmidt gyor help heterogeneous hosmer http huang huber hughes ieee improve improvement incorrect indiana inference information initial institute intelligence iterative itive jaakkola jebara john joint kharin kingdom kluver labeled labelled landgrebe large learning letouzey letters likelihood limiting linear lugosi machine marcel math mathematical maximum mccallum mclachlan meila mellon methods miller mining misspeci mitchell mitigating mixing mixture mixtures model modeling models multi named neill network neural nigam nonstandard nonsupervised normal number observations oles orientation outcome parameter parameters parametric pattern pennsylvania pensacola performance phenomenon poli poolbased posterior prior probabilistic probabilities probability problem problems processing provided publications publishers purdue ratsaby recognition reducing references relative remote report research robustness sample samples school science sebe seeger semi sensing shahshahani side singer size small society sons sources spectral springer stanford statistical statistics supervised symposium systems technical text theory three thrun trans trees types uncertainty unclassi under united university unknown unlabeled unlabelled unsupervised using uyar value venkatesh verlag vision when white wiley with workshop york zhang zhou http://www.hpl.hp.com/conferences/icml2003/papers/24.pdf 33 Probabilistic Classifiers and the Concepts they Recognize aaai affect agresti ambiguities american analysis applied approach arti association bayes bayesian becomes boolean canadian categorical cation center chapman charasteristics choosing cial classi conference connected data discrete domingos duda ecml ekin feature friedman from functions garg geiger goldszmidt hall hammer hart hellerstein holds ijcai incomplete intelligence into john kogan kohavi language learnability learning ling loss machine mathematics maximized multivariate naive natural network obtain optimality pattern pazzani performancetechnical plugging probabilistic problem proceedings references report research resolve rish roth satisfying scene schafer selection simple solving springer subset substituting such that thathachar this under understanding uniquely using watson wiley wrappers zero zhang zhao http://www.hpl.hp.com/conferences/icml2003/papers/352.pdf 94 SimpleSVM advances algorithm algorithms analysis applications bangalore bennett bhattacharyya blake cambridge cations cauwenberghs center cholesky cient classi cortes databases decremental design diagonal directsvm dissertation doctoral factorization fast fine fletcher ieee incremental india indian information institute iterative joachims john journal keerthi kernel kernels large learning life linear lkopf machine machines making mangasarian matrices matrix merz methods minimal modi multicategory murthy nearest networks neural optimization perceptron platt poggio point practical preparation press proceedings processing programming rank real references report repository representation research roobaert scale scheinberg science separation sequential shevade siam signal simple smola software sons support systems technical training using vapnik vector vishwanathan watson wiley with workshop york http://www.hpl.hp.com/conferences/icml2003/papers/54.pdf 39 Exploration in Metric State Spaces algorithm algorithms analysis approximate approximately approximation arti australian bartlett based baxter brafman carlos carnegie cial cient college comp comparison complexity computer conference continuous dale decision department deterministic direct directed discretizations econwpa estimation exploration factored function game general gradient guestrin horizon http icml ijcai intelligence iteration kakade kearns koenig koller langford large learning london mansour markov markovian mcallester mdps mellon methods model moore multidimensional national near nips nite optimal pages papers parti patrascu pittsburgh planning policy polynomial problems proceedings processes quasi random realtime references reinforcement relu report resolution rust sampling schuurmans science simmons singh solving sparse state statespaces sutton technical tennenholtz thesis thrun time university using variable with wustl http://www.hpl.hp.com/conferences/icml2003/papers/269.pdf 95 Testing Exchangeability On-Line acknowledgments aggregating algebra algorithm algorithmic almost annual appendix application applications assuming assumption based basic bbsrc best boosting calibrated cation classi coincides collection compression computational computer conditional conference coordinate dence derandomizing described detail details discus distribution doob each edition endix epsrc equation errn examples exchangeability expectation experiments expert first following follows formal forty foundations francisco freund from full further game gammerman generalization generated give given grant grateful have herbster http idea ieee immediately implicit implies index indicator induction inductive informative integer international intuition ject journal kaufmann known learning left lemma lemmas line logic machine machines making mateo means measurable ministerio modelling morgan most nance necessary need notation nouretdinov numbers obtain omitted only original other over paper partially particular permutation positive prediction probability proceedings processes proof prove proved random randomised randomness refer referees references result royal rtcm same satisfy saunders schapire schervish science second sequence sequences sets shafer shiryaev short similarly simple since sion sixteenth society spanish springer stand statistical statistics stochastic strategies suggestions supported surely symposium temporarily testing that their theorem theory therefore third thirteenth this tracking transducer trial underlying universal upper used usually vapnik variable variables version vovk warmuth well where which whose wiley will with work working workshop york http://www.hpl.hp.com/conferences/icml2003/papers/279.pdf 10 AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Resp onse Against Stationary Opp onents aaai acapulco action adaptive additional agent agents algorithm algorithms also analysis annual arti awesome because behavior boutilier bowling cahn calibrated canada cedure center chapter chastic cial claus colell commerce complexity conditional conference conitzer consistency convergence correlated desirable determine development diego discussion distributed dropped dynamics eared earlier eated econometrica economic eighteenth electronic endent equilibria equilibrium erative erent fast first foster framework from fudenb fundamentally further future games general give gradient guarantee guarantees hart hebrew ijcai important indep information intel international internet introduction israel itself jerusalem joint kearns leading learning levine ligence littman machine madison mansour markov mexico minimal modern multi multiagent nash national neural nips nite observed olynomial open opponents optimal order other papadimitriou play press previous procedures proceedings processing properies properties proving questions rate rationality references regarding reinforcement requirement research results sandholm satisfaction schedules serve several simple singh skeleton spaces stanford state stepping stoc stone strategies systems team technical techniques that theoretical theory there these this those time tracts uncertainty universal university used using valid valuable vancouver variable veloso version viewed vohra vrieze wang weiss wellman which whole with http://www.hpl.hp.com/conferences/icml2003/papers/257.pdf 75 Weighted Order Statistic Classifiers with Large Rank-Order Margin aaai acoustics admitting arce arti astola bagging based blake boosting breast breiman california cancer cannon chen cial circuits class classi computer conference coyle databases decomposition dept digital disease eleventh ensembles error expansion fifteenth figure fitch functions fundamentals gallagher gaussian gaussianexpansion general grove heart howse hush ieee information intel international ionosphere irvine iterations joint kuosmanen learned learning left ligence limit lter ltering lters machine manuscript margin maximizing median merz national negative networks neural nonlinear olean order osting ottom positive predictors press primal proceedings processing rank realizations recognition references regularization repository right sample schuurmans sciences scovel serial signal simple size sonar speech structure submitted systems test threshold training transactions university weighted weights york http://www.hpl.hp.com/conferences/icml2003/papers/105.pdf 85 TD(0) Converges Provably Faster than the Residual Gradient Algorithm above absolute abstraction according additionally adjoint advances advantageous algorithm algorithms already also analogous analysis applied approximation arbitrary arguments arti assertion assumption assumptions asymptotic asymptotically athena auxiliary axioms baird because before behaviour benchmark bertsekas between both calculations called cambridge case cases certain cial combinations common comparing conclude conclusions conditions conference conju consider converge convergence convergent converges corol corresponding demonstrated denote denotes depending determined deterministic directly discount dynamic each easily ectral eigenvalue eigenvalues endix equality equally equivalent equivalently erences euclidean every evidence example experimental extend factor fast faster following framework from frontiers function further gate generality given gradient greenbaum gridworld hall have hermitian holds horn identical implies induced information intel international involves iterative johnson just kaufmann know largest lary learning lemma ligence linear looking loss machine many mathematics matrices matrix mdps merke methods more morgan negative neural neuro nite nitions nonnegative nonsingular norm obtain obtained only optimal optimality ordered other paper parts policy positive possibly precup predict predicted prentice prepared press proceedings processing programming proof property prove proved putting radius random rate real recall references reinforcement remains representation residual respectively result rooms root roots same satis schoknecht scienti section seen semi semide show showed showing shown siam singh singular slower smaller smallest solving some special spectral square stated stochastic such sutton symmetric systems tabular taken temporal than that then theorem theoretical there therefore this thus together transpose tsitsiklis twelfth under university until used using usual value values varga vector versa vice walk where which with without yields http://www.hpl.hp.com/conferences/icml2003/papers/172.pdf 87 Text Bundling: Statistics-Based Data Reduction aaai agglomerative algorithm analysis annual arcing arti automatic based bayes berger bias boostexter boosting bottleneck breiman buckley burges california categorization cation cial classi clustering codes coding collection comparison comparitive conference correcting data department development discovery document domingos error european evaluation event examination experiments explorations feature feedback filtering fourteenth ghani hall hersh hickam http ijcai improving information insititute intelligence interactive international joachims know laboratory language large learning ledge leone machine machines massachusetts mccallum methods mining mladenic modeling models multiclass naive nation neural nigam nips ohsumed output panel pattern pedersen percent prentice probabilistic proceedings processing recognition references relevance rennie report research retrieval rifkin rocchio salton schapire selection seventeenth sigir sigkdd singer slonim smart statistical statistics stockholm study subsample subset support svmfu sweeden system systems technical technology tenth test text textlearning tfidf tishby toolkit tutorial university using variance vector when with workshop yang http://www.hpl.hp.com/conferences/icml2003/papers/152.pdf 72 Justification-based Multiagent Learning aamas able accuracies accuracy agents aggregate algorithm allowing also although among analysis approach aproach armengol arti bagging bartering based because become better boosting both breiman cascade case cases cation cations change chapman cial classi collaborate coming committee conclusions conference constructed cooper crucial data decision decreases degree dence depend dependent descriptions disagree distance distribution drops ecml engineering ensemble ensembles environment equal estimation european evaluate experiments fact favorable francisco freund from future gama generalization give good hall hansen harmelen have higher however hydrid icml ieee iiia importance improve independent individual induction institute intel intelligence into justi kaufmann keep know knowledge lazy learning ledge less level ligence local lopez machine mantaras measure method methods more moreover morgan much multiagent networks neural number obtain only ontanon parties pattern perrone plan plaza policies policy prediction predictions predictors presented prime proc provide provided redundancy references relational report research results review robust salamon same scenarios schapire section seen selection semantic several show shown since single skewed speech still symbolic system systems technical tested than that their them these third transactions tree trees tribute uniform used using view vision well when where which will with work works http://www.hpl.hp.com/conferences/icml2003/papers/119.pdf 6 Choosing b etween two learning algorithms based on calibrated tests accuracy advances algorithms approach approximate arti auckland avoid based bayesian bengio between bootstrap bouckaert calibrated cation choosing cial classi cohen comparing comparisons computation computer conference continuous cross data department dietterich discovery distributions eleventh error estimating estimation fourteenth francisco frank generalization implementations induction inference information intelligence international introduction java jensen john joint kaufmann knowledge kohavi langley learning machine mateo mcgraw mining mitchell model morgan multiple nadeau neural pages paper pitfalls practical press probability proceedings processing programs publishers quinlan recommended references salzberg science selection statistical statistics study supervised systems techniques tests tools uncertainty university validation waikato weber wild with witten working zealand http://www.hpl.hp.com/conferences/icml2003/papers/85.pdf 50 Finding Underlying Connections: A Fast Graph-Based Method for Link Analysis and Collaboration Queries advances advent anderson arti cambridge cial citation cohn conference connectivity content crouch data document elkana francisco friedman getoor hofmann hypertext indicators information intelligence international john joint kaufmann koller learning lederberg link logit malin merton metric missing model models morgan networks neural pfeffer press primer probabilistic proceedings processing publishers references relational science segal seventeenth small social sons structure systems taskar text thackray toward wasserman wiley york zuckerman http://www.hpl.hp.com/conferences/icml2003/papers/132.pdf 113 Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions advances aistats algorithm america analysis approx arbitrary arti assoc back bayesian belief belkin bias bits blum boser bounds boykov cation chapelle chawla chung cial classi cluster clustering combinatorial computation conf conference correctness cuts data database denker diffusion digit discrete doyle edinburgh eigenvectors electric energy error fast freeman freund from functions gaussian generalization graph graphical graphs green grouping handwritten henderson howard hull ieee ijcai image imate information input intelligence international jaakkola jackel joint jordan journal kernels kondor labeled labelled lafferty large learning link machine malik manifold margin markov mathematical meila mincuts minimization models network networks neural niyogi normalized other partially pattern perceptron proc process processing propagation random recognition references report research schapire scholkopf seeger segmentation semi snell spaces spectral stability start structure supervised systems szummer technical text theory thresh topology trans transactions university unlabeled using veksler view walks weiss weston with zabih zheng http://www.hpl.hp.com/conferences/icml2003/papers/41.pdf 40 The Significance of Temporal-Difference Learning in Self-Play Training TD-rummy versus EVO-rummy andrew artificial backgammon barto bingen blair cambridge card clifford coevolution colorado communications conference decision difference doubleday encyclopedia fifth forthcoming games gammon gerald gibson hoyle http informatik international introduction issues land learning life machine master modern network neural player pollack practical press problems proceedings references reinforcement richard science sequential simulator snns springs stuttgart techniques temporal tesauro thesis training tuebingen university walter york http://www.hpl.hp.com/conferences/icml2003/papers/155.pdf 53 Improving accuracy and cost of two-class and multi-class probabilistic classifiers using ROC curves applied arti asker blake blockeel bostrom cial cient combining conquer considerations data databases discovery diterpene divide dzeroski effective elucidation fawcett first from graphs heidtke induction inductive intelligence international issue knowledge kremer learning logic machine merz mining notes order practical proceedings programming references report repository researcherstech rule schulze separate siems special spectra springer structure verlag wettschereck with workshop http://www.hpl.hp.com/conferences/icml2003/papers/140.pdf 110 Modified Logistic Regression: An Approximation to SVM and Its Applications in Large-Scale Text Categorization adaboost addison additive advances again algorithm algorithms analysis appear applications approaches arti based benchmark berlin boosting bregman burges cambridge carnegie categorization cation cial ciently class classi collection collins colt cominetti conclude conference convergence convex crisp data decreasing dept distances dussault ecml edition endix erty european examination experiments exponential fast features final first freund friedman functions ghani given gsvm hard hastie hypertext implies independent inequality information intel international introduction joachims john journal kernel large learning lebanon lewis ligent likelihood linear logistic luenberger machine machines make making many massachusetts maximum maxw maxz methods mika minimal minka models monotonical muller multi nature networks neural nips nishes nocedal nonlinear november numerical oles optimization osition osvm over pattern penalty platt points practical press princeton proc proceedings programming proof prop prove ratsch references regression regularized relevant report research retrieval rockafellar rose scale schapire scholkopf second sequential seventh show sigir since singer slattery solution sons springer stable stanford statistical statistics study such superlinear support symposium systems technical text that then theory this thus tibshirani training uniform uniqueness university using vapnik vector verlag view watkins wesley weston which wiley with wright yang york zhang http://www.hpl.hp.com/conferences/icml2003/papers/263.pdf 69 Optimal Reinsertion: A new search operator for accelerated and more accurate Bayesian network structure learning aaai alarm arti bayesian beinlich belief berlin blake buntine case chavez chickering cial classes complete conference cooper data databases elidan equivalence escaping european fisher friedman from herskovits html http induction inference intelligence kaufmann learning lenz local machine maxima medicine merz method mlearn mlrepository monitoring morgan nement network networks ninio perturbation portland probabilistic proc proceedings references repository schuurmans second seventh springer statistics structures study suermondt system techniques theory twelfth verlag with http://www.hpl.hp.com/conferences/icml2003/papers/208.pdf 83 Adaptive Overrelaxed Bound Optimization Methods acceleration advances aggregate alan algebra algorithm algorithms algorthms american analysers analysis analyze analyzers anand andrew appendix application applications approximation around arti association assume august basis bauer bayesian being bishop bounded cambridge carnegie case cccp cial ciently cients clearly coef comparison component components componentwise computation computational concave conditional conf conference conjugate consider converge convergence convex corresponding cruz daniel daphne data dayne della dempster denoting depends dept derive described description detailed different discussion each easily edition eigenvalues eigenvectors elds empirical entropy eric established estimation euclidean exactly expansion experiment extensions extraction factor factorization fastest features fernando first flannery following form freitag from fruitfly function gene genie geoffrey ghabramani ghahramani global gradient guarantee guaranteed have helmbold hinton http ieee incomplete inducing information intelligence international jamshidian jennrich john journal koller krishnan labeling lafferty laird language large largest lawrence lbnl learning letters likelihood linear linguistics local logistic machine march markov matrix maximum mccallum mclachlan mellon meng methods minka mixedorder mixture mixtures model models more moreover mortaza nakano naonori natural nature negative neighborhood networks neural norm note numerical objective objects obtained optimum orthonormal pages parameter particular parts pattern pereira pietra press principal probabilistic problem proc procedure proceedings processing proof proposition radius random rangarajan rate rates real recipes references regression report represented require results robert roweis rubin rules ryohei same santa saul scaled schapire sebastian second segmentation segmenting sequence series seung shrink shrinkage similar since singer size smallest smem society some spca spectral spirit statistical statistics stephen stretch subspace systems taylor technical teukolsky that therefore this thus tipping toronto transactions ueda uncertainty uniform uniquely university update upper using vector vetterling vicinity vincent volume warmuth when which wiley with within yoram yuille zero zoubin http://www.hpl.hp.com/conferences/icml2003/papers/298.pdf 98 Principled Metho ds for Advising Reinforcement Learning Agents ability accelerated adaptive advice advicetaking advising agent agents ahead algorithm already also alsr alter although among amount analogues application arbitrary architectures assumed athena automation barto begin bertsekas between bias bicycle bled boutilier capable combination conference connection control creating cybernetics design discussion does environmental eventually every feature features fifteenth framework function functions good hailu harada have ieee imitation implicit incorporated incorporating inherent initialization international into introduction invariance itself kaufmann kholsa knowledge learn learner learners learning linear look lyapunov machine maclin malak many mataric mentioned method morgan multiagent must neurodynamic ninth original perkins policy potential presented preserves press price proceedings programming quality randl recieving references reinforcement replacement represent representation representing reward rewards ride robot robotic russell safe scienti shaping shavlik should sixteenth skill slovenia sommer speci state stressed structures sutton systems terms that theory throughout transfer transformations tsitsiklis under uses using value values weight will with without would http://www.hpl.hp.com/conferences/icml2003/papers/215.pdf 76 Relativized Options: Cho osing the Right Transformation agre aitr algebraic appear arti based boundedparameter cial computation dean decision decomposition dietterich doya dynamic englewood equivalence everyday function givan greig hall hartmanis hierarchical institute intel intelligence katagiri kawato leach learning life ligence machines markov massachusetts maxq minimization model multiple neural notions prentice processes references reinforcement report research samejima sequential stearns structure technical technology theory value with