Main Page

Table of Contents

Author Index

Table of Contents

Chairs' Welcome Message
Stan Matwin (Dalhousie University)

Shipeng Yu (LinkedIn)

Faisal Farooq (IBM)

KDD 2017 Organization

KDD 2017 Sponsors and Supporters

KDD 2017 Keynote Talks

KDD 2017 Research Papers
(Poster Papers)

KDD 2017 Applied Invited Talks

KDD 2017 Applied Data Science Papers (Oral Papers)

KDD 2017 Panels

KDD 2017 Applied Data Science Papers (Poster Papers)

KDD 2017 Research Papers
(Oral Papers)

 

KDD 2017 Keynote Talks

What's Fair? (Page 1)
Cynthia Dwork (Microsoft Research & Harvard University)

The Future of Data Integration (Page 3)
Renée J. Miller (University of Toronto)

Three Principles of Data Science: Predictability, Stability and Computability (Page 5)
Bin Yu (University of California, Berkeley)

KDD 2017 Applied Invited Talks

Foreword to the Applied Data Science - Invited Talks Track at KDD-2017 (Page 7)
Usama M. Fayyad (Open Insights)

Evangelos Simoudis (Synapse Partners)

Ashok Srivastava (Verizon)

More than the Sum of its Parts: Building Domino Data Lab (Page 9)
Eduardo Ariño de la Rubia (Domino Data Lab)

Mining Big Data in NeuroGenetics to Understand Muscular Dystrophy (Page 11)
Andy Berglund (University of Florida)

Industrial Machine Learning (Page 13)
Josh Bloom (GE)

Behavior Informatics to Discover Behavior Insight for Active and Tailored Client Management (Page 15)
Longbing Cao (University of Technology Sydney)

It Takes More than Math and Engineering to Hit the Bullseye with Data (Page 17)
Paritosh Desai (Target)

Planning and Learning under Uncertainty: Theory and Practice (Page 19)
Jonathan P. How (Massachusetts Institute of Technology)

Big Data in Climate: Opportunities and Challenges for Machine Learning (Page 21)
Anuj Karpatne (University of Minnesota)

Vipin Kumar (University of Minnesota)

Addressing Challenges with Big Data for Media Measurement (Page 23)
Mainak Mazumdar (Nielsen)

Machine Learning Software in Practice: Quo Vadis? (Page 25)
Szilárd Pafka (Epoch)

Designing AI at Scale to Power Everyday Life (Page 27)
Rajesh Parekh (Facebook)

Spaceborne Data Enters the Mainstream (Page 29)
David Potere (Tellus Laboratories)

KDD 2017 Panels

Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? (Page 31)
Usama M. Fayyad (Open Insights)

Arno Candel (H2O.ai, Inc.)

Eduardo Ariño de la Rubia (Domino Data Lab)

Szilárd Pafka (Epoch)

Anthony Chong (IKASI)

Jeong-Yoon Lee (Microsoft)

The Future of Artificially Intelligent Assistants (Page 33)
Muthu Muthukrishnan (Rutgers University)

Andrew Tomkins (Google)

Larry Heck (Google)

Alborz Geramifard (Amazon)

Deepak Agarwal (LinkedIn)

KDD 2017 Research Papers (Oral Papers)

Learning Certifiably Optimal Rule Lists (Page 35)
Elaine Angelino (University of California, Berkeley)

Nicholas Larus-Stone (Harvard University)

Daniel Alabi (Harvard University)

Margo Seltzer (Harvard University)

Cynthia Rudin (Duke University)

Improved Degree Bounds and Full Spectrum Power Laws in Preferential Attachment Networks (Page 45)
Chen Avin (Ben Gurion University of the Negev)

Zvi Lotker (Ben Gurion University of the Negev)

Yinon Nahum (Weizmann Institute of Science)

David Peleg (Weizmann Institute of Science)

Unsupervised Network Discovery for Brain Imaging Data (Page 55)
Zilong Bai (University of California, Davis)

Peter Walker (Naval Medical Research Center)

Anna Tschiffely (Naval Medical Research Center)

Fei Wang (Cornell University)

Ian Davidson (University of California, Davis)

Patient Subtyping via Time-Aware LSTM Networks (Page 65)
Inci M. Baytas (Michigan State University)

Cao Xiao (IBM T. J. Watson Research Center)

Xi Zhang (Cornell University)

Fei Wang (Cornell University)

Anil K. Jain (Michigan State University)

Jiayu Zhou (Michigan State University)

Robust Top-k Multiclass SVM for Visual Category Recognition (Page 75)
Xiaojun Chang (Carnegie Mellon University)

Yao-Liang Yu (University of Waterloo)

Yi Yang (University of Technology Sydney)

KATE: K-Competitive Autoencoder for Text (Page 85)
Yu Chen (Rensselaer Polytechnic Institute)

Mohammed J. Zaki (Rensselaer Polytechnic Institute)

A Minimal Variance Estimator for the Cardinality of Big Data Set Intersection (Page 95)
Reuven Cohen (Technion)

Liran Katzir (Technion)

Aviv Yehezkel (Technion)

HyperLogLog Hyperextended: Sketches for Concave Sublinear Frequency Statistics (Page 105)
Edith Cohen (Google Research)

Fast Enumeration of Large k-Plexes (Page 115)
Alessio Conte (University of Pisa)

Donatella Firmani (Roma Tre University)

Caterina Mordente (Be Think Solve Execute)

Maurizio Patrignani (Roma Tre University)

Riccardo Torlone (Roma Tre University)

Matrix Profile V: A Generic Technique to Incorporate Domain Knowledge into Motif Discovery (Page 125)
Hoang Anh Dau (University of California Riverside)

Eamonn Keogh (University of California Riverside)

metapath2vec: Scalable Representation Learning for Heterogeneous Networks (Page 135)
Yuxiao Dong (Microsoft Research & University of Notre Dame)

Nitesh V. Chawla (University of Notre Dame)

Ananthram Swami (Army Research Laboratory)

Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters (Page 145)
Alessandro Epasto (Google Research)

Silvio Lattanzi (Google Research Zurich)

Renato Paes Leme (Google Research)

Contextual Motifs: Increasing the Utility of Motifs using Contextual Data (Page 155)
Ian Fox (University of Michigan)

Lynn Ang (University of Michigan)

Mamta Jaiswal (University of Michigan)

Rodica Pop-Busui (University of Michigan)

Jenna Wiens (University of Michigan)

Unsupervised P2P Rental Recommendations via Integer Programming (Page 165)
Yanjie Fu (Missouri University of Science and Technology)

Guannan Liu (Beihang University)

Mingfei Teng (Rutgers University)

Charu Aggarwal (IBM T. J. Watson Research Center)

The Co-Evolution Model for Social Network Evolving and Opinion Migration (Page 175)
Yupeng Gu (University of California, Los Angeles)

Yizhou Sun (University of California, Los Angeles)

Jianxi Gao (Northeastern University)

Groups-Keeping Solution Path Algorithm for Sparse Regression with Automatic Feature Grouping (Page 185)
Bin Gu (University of Texas at Arlington)

Guodong Liu (University of Texas at Arlington)

Heng Huang (University of Texas at Arlington)

Clustering Individual Transactional Data for Masses of Users (Page 195)
Riccardo Guidotti (ISTI-CNR & University of Pisa)

Anna Monreale (University of Pisa)

Mirco Nanni (ISTI-CNR)

Fosca Giannotti (ISTI-CNR)

Dino Pedreschi (University of Pisa)

Network Inference via the Time-Varying Graphical Lasso (Page 205)
David Hallac (Stanford University)

Youngsuk Park (Stanford University)

Stephen Boyd (Stanford University)

Jure Leskovec (Stanford University)

Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data (Page 215)
David Hallac (Stanford University)

Sagar Vare (Stanford University)

Stephen Boyd (Stanford University)

Jure Leskovec (Stanford University)

Efficient Correlated Topic Modeling with Topic Embedding (Page 225)
Junxian He (Carnegie Mellon University & Shanghai Jiao Tong University)

Zhiting Hu (Carnegie Mellon University & Petuum Inc.)

Taylor Berg-Kirkpatrick (Carnegie Mellon University)

Ying Huang (Shanghai Jiao Tong University)

Eric P. Xing (Carnegie Mellon University & Petuum Inc.)

Accelerating Innovation Through Analogy Mining (Page 235)
Tom Hope (Hebrew University of Jerusalem)

Joel Chan (Carnegie Mellon University)

Aniket Kittur (Carnegie Mellon University)

Dafna Shahaf (Hebrew University of Jerusalem)

Communication-Efficient Distributed Block Minimization for Nonlinear Kernel Machines (Page 245)
Cho-Jui Hsieh (University of California, Davis)

Si Si (Google Inc. & Google Research)

Inderjit S. Dhillon (University of Texas at Austin)

A Hierarchical Algorithm for Extreme Clustering (Page 255)
Ari Kobren (University of Massachusetts, Amherst)

Nicholas Monath (University of Massachusetts, Amherst)

Akshay Krishnamurthy (University of Massachusetts, Amherst)

Andrew McCallum (University of Massachusetts, Amherst)

Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing (Page 265)
Kun Kuang (Tsinghua University)

Peng Cui (Tsinghua University)

Bo Li (Tsinghua Univeristy)

Meng Jiang (University of Notre Dame)

Shiqiang Yang (Tsinghua University)

The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables (Page 275)
Himabindu Lakkaraju (Stanford University)

Jon Kleinberg (Cornell University)

Jure Leskovec (Stanford University)

Jens Ludwig (University of Chicago)

Sendhil Mullainathan (Harvard University)

Constructivism Learning: A Learning Paradigm for Transparent Predictive Analytics (Page 285)
Xiaoli Li (University of Kansas)

Jun Huan (University of Kansas)

Is the Whole Greater Than the Sum of Its Parts? (Page 295)
Liangyue Li (Arizona State University)

Hanghang Tong (Arizona State University)

Yong Wang (Hong Kong University of Science and Technology)

Conglei Shi (IBM Research)

Nan Cao (Tongji University)

Norbou Buchler (US Army Research Laboratory)

Collaborative Variational Autoencoder for Recommender Systems (Page 305)
Xiaopeng Li (Hong Kong University of Science and Technology)

James She (Hong Kong University of Science and Technology)

Linearized GMM Kernels and Normalized Random Fourier Features (Page 315)
Ping Li (Rutgers University)

Discrete Content-aware Matrix Factorization (Page 325)
Defu Lian (University of Electronic Science and Technology of China)

Rui Liu (University of Electronic Science and Technology of China)

Yong Ge (University of Arizona)

Kai Zheng (University of Electronic Science and Technology of China)

Xing Xie (Microsoft Research)

Longbing Cao (University of Technology Sydney)

Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams (Page 335)
Junming Liu (Rutgers University)

Yanjie Fu (Missouri University of Science and Technology)

Jingci Ming (Rutgers University)

Yong Ren (Futurewei Tech. Inc)

Leilei Sun (Tsinghua University)

Hui Xiong (Rutgers University)

Functional Annotation of Human Protein Coding Isoforms via Non-convex Multi-Instance Learning (Page 345)
Tingjin Luo (National University of Defense Technology)

Weizhong Zhang (Zhejiang University)

Shang Qiu (University of Michigan)

Yang Yang (Beihang University)

Dongyun Yi (National University of Defense Technology)

Guangtao Wang (University of Michigan)

Jieping Ye (University of Michigan)

Jie Wang (University of Michigan)

Discovering Reliable Approximate Functional Dependencies (Page 355)
Panagiotis Mandros (Max Planck Institute for Informatics & Saarland University)

Mario Boley (Max Planck Institute for Informatics & Saarland University)

Jilles Vreeken (Max Planck Institute for Informatics & Saarland University)

Towards an Optimal Subspace for K-Means (Page 365)
Dominik Mautz (Ludwig-Maximilians-Universität München)

Wei Ye (Ludwig-Maximilians-Universität München)

Claudia Plant (University of Vienna)

Christian Böhm (Ludwig-Maximilians-Universität München)

SPARTan: Scalable PARAFAC2 for Large & Sparse Data (Page 375)
Ioakeim Perros (Georgia Institute of Technology)

Evangelos E. Papalexakis (University of California, Riverside)

Fei Wang (Weill Cornell Medicine)

Richard Vuduc (Georgia Institute of Technology)

Elizabeth Searles (Children's Healthcare Of Atlanta)

Michael Thompson (Children's Healthcare Of Atlanta)

Jimeng Sun (Georgia Institute of Technology)

struc2vec: Learning Node Representations from Structural Identity (Page 385)
Leonardo F. R. Ribeiro (Federal University of Rio de Janeiro)

Pedro H. P. Saverese (Federal University of Rio de Janeiro)

Daniel R. Figueiredo (Federal University of Rio de Janeiro)

Similarity Forests (Page 395)
Saket Sathe (IBM T. J. Watson Research Center)

Charu C. Aggarwal (IBM T. J. Watson Research Center)

Online Ranking with Constraints: A Primal-Dual Algorithm and Applications to Web Traffic-Shaping (Page 405)
Parikshit Shah (Yahoo Research)

Akshay Soni (Yahoo Research)

Troy Chevalier (Yahoo Research)

On Finding Socially Tenuous Groups for Online Social Networks (Page 415)
Chih-Ya Shen (National Tsing Hua University)

Liang-Hao Huang (Academia Sinica)

De-Nian Yang (Academia Sinica)

Hong-Han Shuai (National Chiao Tung University)

Wang-Chien Lee (The Pennsylvania State University)

Ming-Syan Chen (National Taiwan University)

PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks (Page 425)
Yu Shi (University of Illinois at Urbana-Champaign)

Po-Wei Chan (University of Illinois at Urbana-Champaign)

Honglei Zhuang (University of Illinois at Urbana-Champaign)

Huan Gui (University of Illinois at Urbana-Champaign)

Jiawei Han (University of Illinois at Urbana-Champaign)

Multi-Aspect Streaming Tensor Completion (Page 435)
Qingquan Song (Texas A&M University)

Xiao Huang (Texas A&M University)

Hancheng Ge (Texas A&M University)

James Caverlee (Texas A&M University)

Xia Hu (Texas A&M University & Texas A&M Engineering Experiment Station)

Scalable and Sustainable Deep Learning via Randomized Hashing (Page 445)
Ryan Spring (Rice University)

Anshumali Shrivastava (Rice University)

AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification (Page 455)
Yukihiro Tagami (Yahoo Japan Corporation & Kyoto University)

Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking (Page 465)
Gabriele Tolomei (Yahoo Research)

Fabrizio Silvestri (Facebook)

Andrew Haines (Yahoo Research)

Mounia Lalmas (Yahoo Research)

Structural Deep Brain Network Mining (Page 475)
Shen Wang (University of Illinois at Chicago)

Lifang He (Shenzhen University)

Bokai Cao (University of Illinois at Chicago)

Chun-Ta Lu (University of Illinois at Chicago)

Philip S. Yu (University of Illinois at Chicago)

Ann B. Ragin (Northwestern University)

Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods (Page 485)
Suhang Wang (Arizona State University)

Charu Aggarwal (IBM T. J. Watson Research Center)

Huan Liu (Arizona State University)

Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes (Page 495)
Pengfei Wang (Chinese Academy of Sciences)

Yanjie Fu (Missouri University of Science and Technology)

Guannan Liu (Beihang University)

Wenqing Hu (Missouri University of Science and Technology)

Charu Aggarwal (IBM T. J. Watson Research Center)

FORA: Simple and Effective Approximate Single-Source Personalized PageRank (Page 505)
Sibo Wang (University of Queensland & Nanyang Technological University)

Renchi Yang (Nanyang Technological University)

Xiaokui Xiao (Nanyang Technological University)

Zhewei Wei (Renmin University of China & Nanyang Technological University)

Yin Yang (Hamad Bin Khalifa University)

Large-scale Collaborative Ranking in Near-Linear Time (Page 515)
Liwei Wu (University of California, Davis)

Cho-Jui Hsieh (University of California, Davis)

James Sharpnack (University of California, Davis)

HoORaYs: High-order Optimization of Rating Distance for Recommender Systems (Page 525)
Jingwei Xu (Nanjing University)

Yuan Yao (Nanjing University)

Hanghang Tong (Arizona State University)

Xianping Tao (Nanjing University)

Jian Lu (Nanjing University)

Collaboratively Improving Topic Discovery and Word Embeddings by Coordinating Global and Local Contexts (Page 535)
Guangxu Xun (SUNY at Buffalo)

Yaliang Li (SUNY at Buffalo & Baidu Research Big Data Lab)

Jing Gao (SUNY at Buffalo)

Aidong Zhang (SUNY at Buffalo)

PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification (Page 545)
Ian E.H. Yen (Carnegie Mellon University)

Xiangru Huang (University of Texas at Austin)

Wei Dai (Carnegie Mellon University & Petuum Inc.)

Pradeep Ravikumar (Carnegie Mellon University)

Inderjit Dhillon (University of Texas at Austin)

Eric Xing (Carnegie Mellon University & Petuum Inc.)

Local Higher-Order Graph Clustering (Page 555)
Hao Yin (Stanford University)

Austin R. Benson (Stanford University)

Jure Leskovec (Stanford University)

David F. Gleich (Purdue University)

Long Short Memory Process: Modeling Growth Dynamics of Microscopic Social Connectivity (Page 565)
Chengxi Zang (Tsinghua University)

Peng Cui (Tsinghua University)

Christos Faloutsos (Carnegie Mellon University)

Wenwu Zhu (Tsinghua University)

Weisfeiler-Lehman Neural Machine for Link Prediction (Page 575)
Muhan Zhang (Washington University in St. Louis)

Yixin Chen (Washington University in St. Louis)

EmbedJoin: Efficient Edit Similarity Joins via Embeddings (Page 585)
Haoyu Zhang (Indiana University Bloomington)

Qin Zhang (Indiana University Bloomington)

TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams (Page 595)
Chao Zhang (University of Illinois at Urbana-Champaign)

Liyuan Liu (University of Illinois at Urbana-Champaign)

Dongming Lei (University of Illinois at Urbana-Champaign)

Quan Yuan (University of Illinois at Urbana-Champaign)

Honglei Zhuang (University of Illinois at Urbana-Champaign)

Tim Hanratty (U.S. Army Research Lab)

Jiawei Han (University of Illinois at Urbana-Champaign)

Graph Edge Partitioning via Neighborhood Heuristic (Page 605)
Chenzi Zhang (University of Hong Kong & Noah's Ark Lab)

Fan Wei (Stanford University)

Qin Liu (Huawei Noah's Ark Lab & Chinese University of Hong Kong)

Zhihao Gavin Tang (University of Hong Kong)

Zhenguo Li (Huawei Noah's Ark Lab)

Randomization or Condensation? Linear-Cost Matrix Sketching Via Cascaded Compression Sampling (Page 615)
Kai Zhang (Temple University)

Chuanren Liu (Drexel University)

Jie Zhang (Fudan University)

Hui Xiong (Rutgers University)

Eric Xing (Carneigie Mellon University)

Jieping Ye (University of Michigan, Ann Arbor)

Tracking the Dynamics in Crowdfunding (Page 625)
Hongke Zhao (University of Science and Technology of China)

Hefu Zhang (University of Science and Technology of China)

Yong Ge (University of Arizona)

Qi Liu (University of Science and Technology of China)

Enhong Chen (University of Science and Technology of China)

Huayu Li (University of North Carolina at Charlotte)

Le Wu (Hefei University of Technology)

Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks (Page 635)
Huan Zhao (Hong Kong University of Science and Technology)

Quanming Yao (Hong Kong University of Science and Technology)

Jianda Li (Hong Kong University of Science and Technology)

Yangqiu Song (Hong Kong University of Science and Technology)

Dik Lun Lee (Hong Kong University of Science and Technology)

Coresets for Kernel Regression (Page 645)
Yan Zheng (University of Utah)

Jeff M. Phillips (University of Utah)

A Local Algorithm for Structure-Preserving Graph Cut (Page 655)
Dawei Zhou (Arizona State University)

Si Zhang (Arizona State University)

Mehmet Yigit Yildirim (Arizona State University)

Scott Alcorn (Early Warnings LLC.)

Hanghang Tong (Arizona State University)

Hasan Davulcu (Arizona State University)

Jingrui He (Arizona State University)

Anomaly Detection with Robust Deep Autoencoders (Page 665)
Chong Zhou (Worcester Polytechnic Institute)

Randy C. Paffenroth (Worcester Polytechnic Institute)

KDD 2017 Research Papers (Poster Papers)

Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers (Page 687)
Aman Agarwal (Cornell University)

Soumya Basu (Cornell University)

Tobias Schnabel (Cornell University)

Thorsten Joachims (Cornell University)

Tripoles: A New Class of Relationships in Time Series Data (Page 697)
Saurabh Agrawal (University of Minnesota)

Gowtham Atluri (University of Cincinnati)

Anuj Karpatne (University of Minnesota)

William Haltom (University of Minnesota)

Stefan Liess (Univertsity of Minnesota)

Snigdhansu Chatterjee (University of Minnesota)

Vipin Kumar (University of Minnesota)

Post Processing Recommender Systems for Diversity (Page 707)
Arda Antikacioglu (Carnegie Mellon University)

R Ravi (Carnegie Mellon University)

Aspect Based Recommendations: Recommending Items with the Most Valuable Aspects Based on User Reviews (Page 717)
Konstantin Bauman (New York University)

Bing Liu (University of Illinois at Chicago (UIC))

Alexander Tuzhilin (New York University)

Bolt: Accelerated Data Mining with Fast Vector Compression (Page 727)
Davis W. Blalock (Massachusetts Institute of Technology)

John V. Guttag (Massachusetts Institute of Technology)

Robust Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves Latent Embeddings (Page 737)
Aleksandar Bojchevski (Technical University of Munich)

Yves Matkovic (Technical University of Munich)

Stephan Günnemann (Technical University of Munich)

DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection (Page 747)
Bokai Cao (University of Illinois at Chicago)

Lei Zheng (University of Illinois at Chicago)

Chenwei Zhang (University of Illinois at Chicago)

Philip S. Yu (Tsinghua University & University of Illinois at Chicago)

Andrea Piscitello (University of Illinois at Chicago)

John Zulueta (University of Illinois at Chicago)

Olu Ajilore (University of Illinoisat Chicago)

Kelly Ryan (University of Michigan)

Alex D. Leow (University of Illinois at Chicago)

Fast Newton Hard Thresholding Pursuit for Sparsity Constrained Nonconvex Optimization (Page 757)
Jinghui Chen (University of Virginia)

Quanquan Gu (University of Virginia)

On Sampling Strategies for Neural Network-based Collaborative Filtering (Page 767)
Ting Chen (University of California, Los Angeles)

Yizhou Sun (University of California, Los Angeles)

Yue Shi (Yahoo! Research)

Liangjie Hong (Etsy Inc.)

Unsupervised Feature Selection in Signed Social Networks (Page 777)
Kewei Cheng (Arizona State University)

Jundong Li (Arizona State University)

Huan Liu (Arizona State University)

GRAM: Graph-based Attention Model for Healthcare Representation Learning (Page 787)
Edward Choi (Georgia Institute of Technology)

Mohammad Taha Bahadori (Georgia Institute of Technology)

Le Song (Georgia Institute of Technology)

Walter F. Stewart (Sutter Health)

Jimeng Sun (Georgia Institute of Technology)

Algorithmic Decision Making and the Cost of Fairness (Page 797)
Sam Corbett-Davies (Stanford University)

Emma Pierson (Stanford University)

Avi Feller (University of California, Berkeley)

Sharad Goel (Stanford University)

Aziz Huq (University of Chicago)

Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks (Page 807)
Yuxiao Dong (Microsoft Research & University of Notre Dame)

Reid A. Johnson (University of Notre Dame)

Jian Xu (University of Notre Dame)

Nitesh V. Chawla (University of Notre Dame)

Revisiting Power-law Distributions in Spectra of Real World Networks (Page 817)
Nicole Eikmeier (Purdue University)

David F. Gleich (Purdue University)

REMIX: Automated Exploration for Interactive Outlier Detection (Page 827)
Yanjie Fu (Missouri University of Science & Technology)

Charu Aggarwal (IBM T. J. Watson Research Center)

Srinivasan Parthasarathy (IBM T. J. Watson Research Center)

Deepak S. Turaga (IBM T. J. Watson Research Center)

Hui Xiong (Rutgers University)

Anarchists, Unite: Practical Entropy Approximation for Distributed Streams (Page 837)
Moshe Gabel (Technion - Israel Institute of Technology)

Daniel Keren (Haifa University)

Assaf Schuster (Technion - Israel Institute of Technology)

Recurrent Poisson Factorization for Temporal Recommendation (Page 847)
Seyed Abbas Hosseini (Sharif University of Technology)

Keivan Alizadeh (Sharif University of Technology)

Ali Khodadadi (Sharif University of Technology)

Ali Arabzadeh (Sharif University of Technology)

Mehrdad Farajtabar (Georgia Institute of Technology)

Hongyuan Zha (Georgia Institute of Technology)

Hamid R. Rabiee (Sharif University of Technology)

SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis (Page 857)
Qiming Huang (Purdue University)

Michael Zhu (Purdue University & Tsinghua University)

Incremental Dual-memory LSTM in Land Cover Prediction (Page 867)
Xiaowei Jia (University of Minnesota)

Ankush Khandelwal (University of Minnesota)

Guruprasad Nayak (University of Minnesota)

James Gerber (University of Minnesota)

Kimberly Carlson (University of Hawaii Manoa)

Paul West (University of Minnesota)

Vipin Kumar (University of Minnesota)

MetaPAD: Meta Pattern Discovery from Massive Text Corpora (Page 877)
Meng Jiang (University of Illinois at Urbana-Champaign)

Jingbo Shang (University of Illinois at Urbana-Champaign)

Taylor Cassidy (Army Research Laboratory)

Xiang Ren (University of Illinois at Urbana-Champaign)

Lance M. Kaplan (Army Research Laboratory)

Timothy P. Hanratty (Army Research Laboratory)

Jiawei Han (University of Illinois at Urbana-Champaign)

Federated Tensor Factorization for Computational Phenotyping (Page 887)
Yejin Kim (Pohang University of Science and Technology & University of California, San Diego)

Jimeng Sun (Georgia Institute of Technology)

Hwanjo Yu (Pohang University of Science and Technology)

Xiaoqian Jiang (University of California, San Diego)

Statistical Emerging Pattern Mining with Multiple Testing Correction (Page 897)
Junpei Komiyama (University of Tokyo)

Masakazu Ishihata (Hokkaido University)

Hiroki Arimura (Hokkaido University)

Takashi Nishibayashi (VOYAGE GROUP, Inc.)

Shin-ichi Minato (Hokkaido University)

Semi-Supervised Techniques for Mining Learning Outcomes and Prerequisites (Page 907)
Igor Labutov (Carnegie Mellon University)

Yun Huang (University of Pittsburgh)

Peter Brusilovsky (University of Pittsburgh)

Daqing He (University of Pittsburgh)

Prospecting the Career Development of Talents: A Survival Analysis Perspective (Page 917)
Huayu Li (University of North Carolina at Charlotte & Baidu Talent Intelligence Center)

Yong Ge (University of Arizona)

Hengshu Zhu (Baidu Talent Intelligence Center)

Hui Xiong (Rutgers University)

Hongke Zhao (University of Sci. and Tech. of China)

A Context-aware Attention Network for Interactive Question Answering (Page 927)
Huayu Li (University of North Carolina, Charlotte)

Martin Renqiang Min (NEC Laboratories America)

Yong Ge (University of Arizona)

Asim Kadav (NEC Laboratories America)

Distributed Multi-Task Relationship Learning (Page 937)
Sulin Liu (Nanyang Technological University, Singapore)

Sinno Jialin Pan (Nanyang Technological University, Singapore)

Qirong Ho (Petuum, Inc.)

Point-of-Interest Demand Modeling with Human Mobility Patterns (Page 947)
Yanchi Liu (Rutgers University)

Chuanren Liu (Drexel University)

Xinjiang Lu (Northwestern Polytechnical University)

Mingfei Teng (Rutgers University)

Hengshu Zhu (Baidu Talent Intelligence Center)

Hui Xiong (Rutgers University)

Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion (Page 957)
Junming Liu (Rutgers University)

Leilei Sun (Tsinghua University)

Qiao Li (Rutgers University)

Jingci Ming (Rutgers University)

Yanchi Liu (Rutgers University)

Hui Xiong (Rutgers University)

Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources (Page 967)
Fenglong Ma (SUNY Buffalo)

Chuishi Meng (SUNY Buffalo)

Houping Xiao (SUNY Buffalo)

Qi Li (SUNY Buffalo)

Jing Gao (SUNY Buffalo)

Lu Su (SUNY Buffalo)

Aidong Zhang (SUNY Buffalo)

Let's See Your Digits: Anomalous-State Detection using Benford's Law (Page 977)
Samuel Maurus (Technical University of Munich)

Claudia Plant (University of Vienna)

Mixture Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting (Page 987)
Guo-Jun Qi (University of Central Florida)

Jiliang Tang (Michigan State University)

Jingdong Wang (Microsoft Research Asia and Hefei University of Technology)

Jiebo Luo (University of Rochester)

Automatic Synonym Discovery with Knowledge Bases (Page 997)
Meng Qu (University of Illinois at Urbana-Champaign)

Xiang Ren (University of Illinois at Urbana-Champaign)

Jiawei Han (University of Illinois at Urbana-Champaign)

An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance (Page 1007)
Edward Raff (Laboratory for Physical Sciences)

Charles Nicholas (University of Maryland, Baltimore County)

Inferring the Strength of Social Ties: A Community-Driven Approach (Page 1017)
Polina Rozenshtein (Aalto University)

Nikolaj Tatti (Aalto University)

Aristides Gionis (Aalto University)

Detecting Network Effects: Randomizing Over Randomized Experiments (Page 1027)
Martin Saveski (Massachusetts Institute of Technology)

Jean Pouget-Abadie (Harvard University)

Guillaume Saint-Jacques (Massachusetts Institute of Technology)

Weitao Duan (LinkedIn)

Souvik Ghosh (LinkedIn)

Ya Xu (LinkedIn)

Edoardo M. Airoldi (Harvard University)

When is a Network a Network? Multi-Order Graphical Model Selection in Pathways and Temporal Networks (Page 1037)
Ingo Scholtes (ETH Zürich)

ReasoNet: Learning to Stop Reading in Machine Comprehension (Page 1047)
Yelong Shen (Microsoft Research)

Po-Sen Huang (Microsoft Research)

Jianfeng Gao (Microsoft Research)

Weizhu Chen (Microsoft Research)

DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams (Page 1057)
Kijung Shin (Carnegie Mellon University)

Bryan Hooi (Carnegie Mellon University)

Jisu Kim (Carnegie Mellon University)

Christos Faloutsos (Carnegie Mellon University)

Anomaly Detection in Streams with Extreme Value Theory (Page 1067)
Alban Siffer (IRISA)

Pierre-Alain Fouque (IRISA)

Alexandre Termier (IRISA)

Christine Largouet (IRISA)

Relay-Linking Models for Prominence and Obsolescence in Evolving Networks (Page 1077)
Mayank Singh (IIT Kharagpur)

Rajdeep Sarkar (IIT Kharagpur)

Pawan Goyal (IIT Kharagpur)

Animesh Mukherjee (IIT Kharagpur)

Soumen Chakrabarti (IIT Bombay)

PAMAE: Parallel k-Medoids Clustering with High Accuracy and Efficiency (Page 1087)
Hwanjun Song (Korea Advanced Institute of Science and Technology)

Jae-Gil Lee (Korea Advanced Institute of Science and Technology)

Wook-Shin Han (POSTECH)

Sparse Compositional Local Metric Learning (Page 1097)
Joseph St.Amand (University of Kansas)

Jun Huan (University of Kansas)

End-to-end Learning for Short Text Expansion (Page 1105)
Jian Tang (University of Michigan)

Yue Wang (University of Michigan)

Kai Zheng (University of California, Irvine)

Qiaozhu Mei (University of Michigan)

Construction of Directed 2K Graphs (Page 1115)
Bálint Tillman (University of California, Irvine)

Athina Markopoulou (University of California, Irvine)

Carter T. Butts (University of California, Irvine)

Minas Gjoka (University of California, Irvine)

Optimized Risk Scores (Page 1125)
Berk Ustun (Massachusetts Institute of Technology)

Cynthia Rudin (Duke University)

A Location-Sentiment-Aware Recommender System for Both Home-Town and Out-of-Town Users (Page 1135)
Hao Wang (Qihoo 360 Search Lab)

Yanmei Fu (Chinese Academy of Sciences & University of Chinese Academy of Sciences)

Qinyong Wang (University of Queensland)

Hongzhi Yin (University of Queensland)

Changying Du (Chinese Academy of Sciences)

Hui Xiong (Rutgers University)

Adversary Resistant Deep Neural Networks with an Application to Malware Detection (Page 1145)
Qinglong Wang (Pennsylvania State University & McGill University)

Wenbo Guo (The Pennsylvania State University)

Kaixuan Zhang (The Pennsylvania State University)

Alexander G. Ororbia II (The Pennsylvania State University)

Xinyu Xing (The Pennsylvania State University)

Xue Liu (McGill University)

C. Lee Giles (The Pennsylvania State University)

Multi-Modality Disease Modeling via Collective Deep Matrix Factorization (Page 1155)
Qi Wang (Michigan State University)

Mengying Sun (Michigan State University)

Liang Zhan (University of Wisconsin-Stout)

Paul Thompson (University of Southern California)

Shuiwang Ji (Washington State University)

Jiayu Zhou (Michigan State University)

Decomposed Normalized Maximum Likelihood Codelength Criterion for Selecting Hierarchical Latent Variable Models (Page 1165)
Tianyi Wu (University of Tokyo)

Shinya Sugawara (University of Tokyo)

Kenji Yamanishi (University of Tokyo)

Structural Event Detection from Log Messages (Page 1175)
Fei Wu (The Pennsylvania State University)

Pranay Anchuri (NEC Laboratories America)

Zhenhui Li (The Pennsylvania State University)

Retrospective Higher-Order Markov Processes for User Trails (Page 1185)
Tao Wu (Purdue University)

David F. Gleich (Purdue University)

Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates (Page 1195)
Liyang Xie (Michigan State University)

Inci M. Baytas (Michigan State University)

Kaixiang Lin (Michigan State University)

Jiayu Zhou (Michigan State University)

Evaluating U.S. Electoral Representation with a Joint Statistical Model of Congressional Roll-Calls, Legislative Text, and Voter Registration Data (Page 1205)
Zhengming Xing (Criteo Labs)

Sunshine Hillygus (Duke University)

Lawrence Carin (Duke University)

Convex Factorization Machine for Toxicogenomics Prediction (Page 1215)
Makoto Yamada (RIKEN AIP, JST PRESTO)

Wenzhao Lian (Vicarious)

Amit Goyal (Yahoo Research)

Jianhui Chen (Microsoft)

Kishan Wimalawarne (Kyoto University)

Suleiman A. Khan (University of Helsinki)

Samuel Kaski (Aalto University)

Hiroshi Mamitsuka (Kyoto University & Aalto University)

Yi Chang (Huawei Research America)

Distributed Local Outlier Detection in Big Data (Page 1225)
Yizhou Yan (Worcester Polytechnic Institute)

Lei Cao (Massachusetts Institute of Technology)

Caitlin Kulhman (Worcester Polytechnic Institute)

Elke Rundensteiner (Worcester Polytechnic Institute)

Scalable Top-n Local Outlier Detection (Page 1235)
Yizhou Yan (Worcester Polytechnic Institute)

Lei Cao (Massachusetts Institute of Technology)

Elke A. Rundensteiner (Worcester Polytechnic Institute)

Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation (Page 1245)
Carl Yang (University of Illinois, Urbana Champaign)

Lanxiao Bai (University of Illinois, Urbana Champaign)

Chao Zhang (University of Illinois, Urbana Champaign)

Quan Yuan (University of Illinois, Urbana Champaign)

Jiawei Han (University of Illinois, Urbana Champaign)

Multi-task Function-on-function Regression with Co-grouping Structured Sparsity (Page 1255)
Pei Yang (South China University of Technology & Arizona State University)

Qi Tan (South China Normal University)

Jingrui He (Arizona State University)

Learning from Labeled and Unlabeled Vertices in Networks (Page 1265)
Wei Ye (Ludwig-Maximilians-Universität München)

Linfei Zhou (Ludwig-Maximilians-Universität München)

Dominik Mautz (Ludwig-Maximilians-Universität München)

Claudia Plant (University of Vienna)

Christian Böhm (Ludwig-Maximilians-Universität München)

Small Batch or Large Batch? Gaussian Walk with Rebound Can Teach (Page 1275)
Peifeng Yin (IBM Almaden Research Center)

Ping Luo (Chinese Academy of Sciences & University of Chinese Academy of Sciences)

Taiga Nakamura (IBM Almaden Research Center)

Learning from Multiple Teacher Networks (Page 1285)
Shan You (Peking University)

Chang Xu (University of Sydney)

Chao Xu (Peking University)

Dacheng Tao (University of Sydney)

A Temporally Heterogeneous Survival Framework with Application to Social Behavior Dynamics (Page 1295)
Linyun Yu (Tsinghua University)

Peng Cui (Tsinghua University)

Chaoming Song (University of Miami)

Tianyang Zhang (Tsinghua University)

Shiqiang Yang (Tsinghua University)

Inductive Semi-supervised Multi-Label Learning with Co-Training (Page 1305)
Wang Zhan (Southeast University & Ministry of Education)

Min-Ling Zhang (Southeast University & Collaborative Innovation Center of Wireless Communications Technology)

LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity (Page 1315)
Yutao Zhang (Tsinghua University)

Robert Chen (Georgia Institute of Technology)

Jie Tang (Tsinghua University)

Walter F. Stewart (Sutter Health)

Jimeng Sun (Georgia Institute of Technology)

Visualizing Attributed Graphs via Terrain Metaphor (Page 1325)
Yang Zhang (Ohio State University)

Yusu Wang (Ohio State University)

Srinivasan Parthasarathy (Ohio State University)

Achieving Non-Discrimination in Data Release (Page 1335)
Lu Zhang (University of Arkansas)

Yongkai Wu (University of Arkansas)

Xintao Wu (University of Arkansas)

KDD 2017 Applied Data Science Papers (Oral Papers)

Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale (Page 1357)
Adrian Albert (Massachusetts Institute of Technology)

Jasleen Kaur (Philips Lighting Research)

Marta C. Gonzalez (Massachusetts Institute of Technology)

Luck is Hard to Beat: The Difficulty of Sports Prediction (Page 1367)
Raquel Y S Aoki (Universidade Federal de Minas Gerais)

Renato M. Assuncao (Universidade Federal de Minas Gerais)

Pedro O S Vaz de Melo (Universidade Federal de Minas Gerais)

Planning Bike Lanes based on Sharing-Bikes' Trajectories (Page 1377)
Jie Bao (Microsoft Research)

Tianfu He (Harbin Institution of Technology)

Sijie Ruan (Xidian University)

Yanhua Li (WPI)

Yu Zheng (Microsoft Research)

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)
Denis Baylor (Google Inc.)

Eric Breck (Google Inc.)

Heng-Tze Cheng (Google Inc.)

Noah Fiedel (Google Inc.)

Chuan Yu Foo (Google Inc.)

Zakaria Haque (Google Inc.)

Salem Haykal (Google Inc.)

Mustafa Ispir (Google Inc.)

Vihan Jain (Google Inc.)

Levent Koc (Google Inc.)

Chiu Yuen Koo (Google Inc.)

Lukasz Lew (Google Inc.)

Clemens Mewald (Google Inc.)

Akshay Naresh Modi (Google Inc.)

Neoklis Polyzotis (Google Inc.)

Sukriti Ramesh (Google Inc.)

Sudip Roy (Google Inc.)

Steven Euijong Whang (Google Inc.)

Martin Wicke (Google Inc.)

Jarek Wilkiewicz (Google Inc.)

Xin Zhang (Google Inc.)

Martin Zinkevich (Google Inc.)

LiJAR: A System for Job Application Redistribution towards Efficient Career Marketplace (Page 1397)
Fedor Borisyuk (LinkedIn Corporation)

Liang Zhang (LinkedIn Corporation)

Krishnaram Kenthapadi (LinkedIn Corporation)

A Data Science Approach to Understanding Residential Water Contamination in Flint (Page 1407)
Alex Chojnacki (University of Michigan)

Chengyu Dai (University of Michigan)

Arya Farahi (University of Michigan)

Guangsha Shi (University of Michigan)

Jared Webb (Brigham Young University)

Daniel T. Zhang (University of Michigan)

Jacob Abernethy (University of Michigan)

Eric Schwartz (University of Michigan)

Estimation of Recent Ancestral Origins of Individuals on a Large Scale (Page 1417)
Ross E. Curtis (AncestryDNA)

Ahna R. Girshick (AncestryDNA)

A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments (Page 1427)
Pavel Dmitriev (Microsoft Corporation)

Somit Gupta (Microsoft Corporation)

Dong Woo Kim (Microsoft Corporation)

Garnet Vaz (Microsoft Corporation)

A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations (Page 1437)
Yuxiao Dong (Microsoft Research)

Hao Ma (Microsoft Research)

Zhihong Shen (Microsoft Research)

Kuansan Wang (Microsoft Research)

FIRST: Fast Interactive Attributed Subgraph Matching (Page 1447)
Boxin Du (Arizona State University)

Si Zhang (Arizona State University)

Nan Cao (Tongji University)

Hanghang Tong (Arizona State University)

Prognosis and Diagnosis of Parkinson's Disease Using Multi-Task Learning (Page 1457)
Saba Emrani (SAS Institute Inc.)

Anya McGuirk (SAS Institute Inc.)

Wei Xiao (SAS Institute Inc.)

A Data Mining Framework for Valuing Large Portfolios of Variable Annuities (Page 1467)
Guojun Gan (University of Connecticut)

Jimmy Xiangji Huang (York University)

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)
Saurav Ghosh (Virginia Tech)

Prithwish Chakraborty (Virginia Tech)

Bryan L. Lewis (Virginia Tech)

Maimuna S. Majumder (Massachusetts Institute of Technology & Boston Children's Hospital)

Emily Cohn (Boston Children's Hospital)

John S. Brownstein (Boston Children's Hospital)

Madhav V. Marathe (Virginia Tech)

Naren Ramakrishnan (Virginia Tech)

Google Vizier: A Service for Black-Box Optimization (Page 1487)
Daniel Golovin (Google Research)

Benjamin Solnik (Google Research)

Subhodeep Moitra (Google Research)

Greg Kochanski (Google Research)

John Karro (Google Research)

D. Sculley (Google Research)

Predicting Clinical Outcomes Across Changing Electronic Health Record Systems (Page 1497)
Jen J. Gong (Massachusetts Institute of Technology)

Tristan Naumann (Massachusetts Institute of Technology)

Peter Szolovits (Massachusetts Institute of Technology)

John V. Guttag (Massachusetts Institute of Technology)

HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network (Page 1507)
Shifu Hou (West Virginia University)

Yanfang Ye (West Virginia University)

Yangqiu Song (HKUST)

Melih Abdulhayoglu (Comodo Security Solutions, Inc.)

Peeking at A/B Tests: Why it matters, and what to do about it (Page 1517)
Ramesh Johari (Stanford University)

Pete Koomen (Optimizely, Inc.)

Leonid Pekelis (Optimizely, Inc.)

David Walsh (Stanford University)

PNP: Fast Path Ensemble Method for Movie Design (Page 1527)
Danai Koutra (University of Michigan)

Abhilash Dighe (University of Michigan)

Smriti Bhagat (Facebook & Technicolor)

Udi Weinsberg (Facebook & Technicolor)

Stratis Ioannidis (Northeastern University)

Christos Faloutsos (Carnegie Mellon University)

Jean Bolot (Technicolor)

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data (Page 1537)
Zhaobin Kuang (University of Wisconsin-Madison)

Peggy Peissig (Marshfield Clinic)

Vitor Santos Costa (Universidade do Porto)

Richard Maclin (University of Minnesota-Duluth)

David Page (University of Wisconsin-Madison)

FLAP: An End-to-End Event Log Analysis Platform for System Management (Page 1547)
Tao Li (Nanjing University of Posts and Telecommunications)

Yexi Jiang (Florida International University)

Chunqiu Zeng (Florida International University)

Bin Xia (Nanjing University of Posts and Telecommunications)

Zheng Liu (Nanjing University of Posts and Telecommunications)

Wubai Zhou (Florida International University)

Xiaolong Zhu (Florida International University)

Wentao Wang (Florida International University)

Liang Zhang (Huawei Nanjing Research and Development Center)

Jun Wu (Huawei Nanjing Research and Development Center)

Li Xue (Huawei Nanjing Research and Development Center)

Dewei Bao (Huawei Nanjing Research and Development Center)

Cascade Ranking for Operational E-commerce Search (Page 1557)
Shichen Liu (Alibaba Group)

Fei Xiao (Alibaba Group)

Wenwu Ou (Alibaba Group)

Luo Si (Alibaba Group)

Developing a Comprehensive Framework for Multimodal Feature Extraction (Page 1567)
Quinten McNamara (University of Texas at Austin)

Alejandro De La Vega (University of Texas at Austin)

Tal Yarkoni (University of Texas at Austin)

Deep Choice Model Using Pointer Networks for Airline Itinerary Prediction (Page 1575)
Alejandro Mottini (Amadeus SAS)

Rodrigo Acuna-Agost (Amadeus SAS)

Compass: Spatio Temporal Sentiment Analysis of US Election: What Twitter Says! (Page 1585)
Debjyoti Paul (University of Utah)

Feifei Li (University of Utah)

Murali Krishna Teja (University of Utah)

Xin Yu (University of Utah)

Richie Frost (University of Utah)

Backpage and Bitcoin: Uncovering Human Traffickers (Page 1595)
Rebecca S. Portnoff (University of California, Berkeley)

Danny Yuxing Huang (University of California, San Diego)

Periwinkle Doerfler (New York University)

Sadia Afroz (ICSI)

Damon McCoy (New York University)

"Not All Passes Are Created Equal:" Objectively Measuring The Risk and Reward of Passes in Soccer from Tracking Data (Page 1605)
Paul Power (STATS)

Hector Ruiz (STATS)

Xinyu Wei (STATS)

Patrick Lucey (STATS)

MARAS: Signaling Multi-Drug Adverse Reactions (Page 1615)
Xiao Qin (Worcester Polytechnic Institute)

Tabassum Kakar (Worcester Polytechnic Institute)

Susmitha Wunnava (Worcester Polytechnic Institute)

Elke A. Rundensteiner (Worcester Polytechnic Institute)

Lei Cao (Massachusetts Institute of Technology)

A Practical Exploration System for Search Advertising (Page 1625)
Parikshit Shah (Yahoo Research)

Ming Yang (Yahoo)

Sachidanand Alle (Yahoo)

Adwait Ratnaparkhi (Yahoo Research)

Ben Shahshahani (Yahoo Research)

Rohit Chandra (Yahoo)

MOLIERE: Automatic Biomedical Hypothesis Generation System (Page 1633)
Justin Sybrandt (Clemson University)

Michael Shtutman (University of South Carolina)

Ilya Safro (Clemson University)

Quick Access: Building a Smart Experience for Google Drive (Page 1643)
Sandeep Tata (Google USA)

Alexandrin Popescul (Google USA)

Marc Najork (Google USA)

Mike Colagrosso (Google USA)

Julian Gibbons (Google Australia)

Alan Green (Google Australia)

Alexandre Mah (Google Australia)

Michael Smith (Google Australia)

Divanshu Garg (Google Australia)

Cayden Meyer (Google Australia)

Reuben Kan (Google Australia)

The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms (Page 1653)
Yongxin Tong (Beihang University)

Yuqiang Chen (4Paradigm Inc.)

Zimu Zhou (ETH Zurich)

Lei Chen (Hong Kong University of Science and Technology)

Jie Wang (Didi Research Institute)

Qiang Yang (4Paradigm Inc. & Hong Kong University of Science and Technology)

Jieping Ye (Didi Research Institute)

Weifeng Lv (Beihang University)

DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution (Page 1663)
Thomas Vandal (Northeastern University)

Evan Kodra (risQ Inc.)

Sangram Ganguly (Bay Area Environmental Research Institute / NASA Ames Research Center)

Andrew Michaelis (University Corporation, Monterey Bay)

Ramakrishna Nemani (NASA Advanced Supercomputing Division / NASA Ames Research Center)

Auroop R. Ganguly (Northeastern University)

No Longer Sleeping with a Bomb: A Duet System for Protecting Urban Safety from Dangerous Goods (Page 1673)
Jingyuan Wang (Beihang University)

Chao Chen (Beihang University)

Junjie Wu (Beihang University)

Zhang Xiong (Beihang University)

A Quasi-experimental Estimate of the Impact of P2P Transportation Platforms on Urban Consumer Patterns (Page 1683)
Zhe Zhang (Carnegie Mellon University)

Beibei Li (Carnegie Mellon University)

KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial (Page 1693)
Jun Zhou (Ant Financial Services Group)

Xiaolong Li (Ant Financial Services Group)

Peilin Zhao (Ant Financial Services Group)

Chaochao Chen (Ant Financial Services Group)

Longfei Li (Ant Financial Services Group)

Xinxing Yang (Ant Financial Services Group)

Qing Cui (Alibaba Cloud)

Jin Yu (Alibaba Cloud)

Xu Chen (Alibaba Cloud)

Yi Ding (Alibaba Cloud)

Yuan Alan Qi (Ant Financial Services Group)

Deep Embedding Forest: Forest-based Serving with Deep Embedding Features (Page 1703)
Jie Zhu (Microsoft Corporation)

Ying Shan (Microsoft Corporation)

JC Mao (Microsoft Corporation)

Dong Yu (Microsoft Corporation)

Holakou Rahmanian (University of California, Santa Cruz)

Yi Zhang (Microsoft Corporation)

KDD 2017 Applied Data Science Papers (Poster Papers)

A Practical Algorithm for Solving the Incoherence Problem of Topic Models In Industrial Applications (Page 1713)
Amr Ahmed (Google Research)

James Long (Google Research)

Daniel Silva (Google Research)

Yuan Wang (Google Research)

Machine Learning for Encrypted Malware Traffic Classification: Accounting for Noisy Labels and Non-Stationarity (Page 1723)
Blake Anderson (Cisco Systems, Inc.)

David McGrew (Cisco Systems, Inc.)

Extremely Fast Decision Tree Mining for Evolving Data Streams (Page 1733)
Albert Bifet (Telecom ParisTech)

Jiajin Zhang (HUAWEI)

Wei Fan (Baidu Research Big Data Lab)

Cheng He (HUAWEI)

Jianfeng Zhang (HUAWEI)

Jianfeng Qian (Columbia University)

Geoff Holmes (University of Waikato)

Bernhard Pfahringer (University of Waikato)

Real-Time Optimization of Web Publisher RTB Revenues (Page 1743)
Pedro Chahuara (XRCE)

Nicolas Grislain (AlephD)

Gregoire Jauvion (AlephD)

Jean-Michel Renders (XRCE)

Customer Lifetime Value Prediction Using Embeddings (Page 1753)
Benjamin Paul Chamberlain (Imperial College London)

Ângelo Cardoso (ASOS.com)

C.H. Bryan Liu (ASOS.com)

Roberto Pagliari (ASOS.com)

Marc Peter Deisenroth (Imperial College London)

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)
Heng-Tze Cheng (Google, Inc.)

Zakaria Haque (Google, Inc.)

Lichan Hong (Google, Inc.)

Mustafa Ispir (Google, Inc.)

Clemens Mewald (Google, Inc)

Illia Polosukhin (Google, Inc.)

Georgios Roumpos (Google, Inc.)

D Sculley (Google, Inc.)

Jamie Smith (Google, Inc.)

David Soergel (Google, Inc.)

Yuan Tang (Uptake Technologies, Inc.)

Philipp Tucker (Google, Inc.)

Martin Wicke (Google, Inc.)

Cassandra Xia (Google, Inc.)

Jianwei Xie (Google, Inc.)

Learning Tree-Structured Detection Cascades for Heterogeneous Networks of Embedded Devices (Page 1773)
Hamid Dadkhahi (University of Massachusetts, Amherst)

Benjamin M. Marlin (University of Massachusetts, Amherst)

AESOP: Automatic Policy Learning for Predicting and Mitigating Network Service Impairments (Page 1783)
Supratim Deb (AT&T Labs)

Zihui Ge (AT&T Labs)

Sastry Isukapalli (AT&T Labs)

Sarat Puthenpura (AT&T Labs)

Shobha Venkataraman (AT&T Labs)

He Yan (AT&T Labs)

Jennifer Yates (AT&T Labs)

Automated Categorization of Onion Sites for Analyzing the Darkweb Ecosystem (Page 1793)
Shalini Ghosh (SRI International)

Ariyam Das (University of California, Los Angeles)

Phil Porras (SRI International)

Vinod Yegneswaran (SRI International)

Ashish Gehani (SRI International)

Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster (Page 1803)
Naeemul Hassan (University of Mississippi)

Fatma Arslan (University of Texas at Arlington)

Chengkai Li (University of Texas at Arlington)

Mark Tremayne (University of Texas at Arlington)

An Efficient Bandit Algorithm for Realtime Multivariate Optimization (Page 1813)
Daniel N. Hill (Amazon.com, Inc.)

Houssam Nassif (Amazon.com, Inc.)

Yi Liu (Amazon.com, Inc.)

Anand Iyer (Amazon.com, Inc.)

S.V.N. Vishwanathan (Amazon.com, Inc. & University of California, Santa Cruz)

Large Scale Sentiment Learning with Limited Labels (Page 1823)
Vasileios Iosifidis (Leibniz University Hanover & L3S Research Center)

Eirini Ntoutsi (Leibniz University Hanover & L3S Research Center)

Optimization Beyond Prediction:Prescriptive Price Optimization (Page 1833)
Shinji Ito (NEC Corporation)

Ryohei Fujimaki (NEC Corpoartion)

Finding Precursors to Anomalous Drop in Airspeed During a Flight's Takeoff (Page 1843)
Vijay Manikandan Janakiraman (USRA/ NASA Ames Research Center)

Bryan Matthews (USRA/ NASA Ames Research Center)

Nikunj Oza (NASA Ames Research Center)

Ad Serving with Multiple KPIs (Page 1853)
Brendan Kitts (PrecisionDemand)

Michael Krishnan (Adap.tv)

Ishadutta Yadav (Adap.tv)

Yongbo Zeng (Adap.tv)

Garrett Badeau (Adap.tv)

Andrew Potter (AOL Platforms)

Sergey Tolkachov (AOL Platforms)

Ethan Thornburg (AOL Platforms)

Satyanarayana Reddy Janga (AOL Platforms)

Discovering Pollution Sources and Propagation Patterns in Urban Area (Page 1863)
Xiucheng Li (Nanyang Technological University)

Yun Cheng (Air Scientific)

Gao Cong (Nanyang Technological University)

Lisi Chen (Hong Kong Baptist University)

Discovering Enterprise Concepts Using Spreadsheet Tables (Page 1873)
Keqian Li (University of California, Santa Barbara & Microsoft Research)

Yeye He (Microsoft Research)

Kris Ganjam (Microsoft Research)

Supporting Employer Name Normalization at both Entity and Cluster Level (Page 1883)
Qiaoling Liu (CareerBuilder LLC)

Faizan Javed (CareerBuilder LLC)

Vachik S. Dave (Indiana University - Purdue University Indianapolis & CareerBuilder LLC)

Ankita Joshi (University of Georgia & CareerBuilder LLC)

BDT: Gradient Boosted Decision Tables for High Accuracy and Scoring Efficiency (Page 1893)
Yin Lou (Airbnb Incorporation)

Mikhail Obukhov (LinkedIn Corporation)

Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks (Page 1903)
Fenglong Ma (SUNY Buffalo & Xerox)

Radha Chitta (Conduent Labs US)

Jing Zhou (Conduent Labs US)

Quanzeng You (University of Rochester)

Tong Sun (United Technologies Research Center & Xerox)

Jing Gao (SUNY Buffalo)

Internet Device Graphs (Page 1913)
Matthew Malloy (comScore)

Paul Barford (comScore & University of Wisconsin)

Enis Ceyhun Alp (comScore & University of Wisconsin)

Jonathan Koller (comScore)

Adria Jewell (comScore)

RUSH! Targeted Time-limited Coupons via Purchase Forecasts (Page 1923)
Emaad Manzoor (Carnegie Mellon University)

Leman Akoglu (Carnegie Mellon University)

Embedding-based News Recommendation for Millions of Users (Page 1933)
Shumpei Okura (Yahoo Japan Corporation)

Yukihiro Tagami (Yahoo Japan Corporation)

Shingo Ono (Yahoo Japan Corporation)

Akira Tajima (Yahoo Japan Corporation)

Learning to Count Mosquitoes for the Sterile Insect Technique (Page 1943)
Yaniv Ovadia (Google Inc.)

Yoni Halpern (Google Inc.)

Dilip Krishnan (Google Inc.)

Josh Livni (Verily Inc.)

Daniel Newburger (Verily Inc.)

Ryan Poplin (Verily Inc.)

Tiantian Zha (Verily Inc.)

D. Sculley (Google Inc.)

An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis (Page 1951)
Lujia Pan (Huawei Technologies)

Jianfeng Zhang (Huawei Technologies)

Patrick P. C. Lee (Chinese University of Hong Kong)

Hong Cheng (Chinese University of Hong Kong)

Cheng He (Huawei Technologies)

Caifeng He (Huawei Technologies)

Keli Zhang (Huawei Technologies)

Deep Design: Product Aesthetics for Heterogeneous Markets (Page 1961)
Yanxin Pan (University of Michigan)

Alexander Burnap (University of Michigan)

Jeffrey Hartley (General Motors)

Richard Gonzalez (University of Michigan)

Panos Y. Papalambros (University of Michigan)

Collecting and Analyzing Millions of mHealth Data Streams (Page 1971)
Tom Quisel (Evidation Health, Inc.)

Luca Foschini (Evidation Health, Inc.)

Alessio Signorini (Evidation Health, Inc.)

David C. Kale (University of Southern California)

Dispatch with Confidence: Integration of Machine Learning, Optimization and Simulation for Open Pit Mines (Page 1981)
Kosta Ristovski (Hitachi America Ltd)

Chetan Gupta (Hitachi America Ltd)

Kunihiko Harada (Hitachi America Ltd)

Hsiu-Khuern Tang (Hitachi America Ltd)

"The Leicester City Fairytale?": Utilizing New Soccer Analytics Tools to Compare Performance in the 15/16 & 16/17 EPL Seasons (Page 1991)
Hector Ruiz (STATS)

Paul Power (STATS)

Xinyu Wei (STATS)

Patrick Lucey (STATS)

Matching Restaurant Menus to Crowdsourced Food Data: A Scalable Machine Learning Approach (Page 2001)
Hesam Salehian (Under Armour Connected Fitnes)

Patrick Howell (Under Armour Connected Fitnes)

Chul Lee (Under Armour Connected Fitnes)

The Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue (Page 2011)
Ashlesh Sharma (Entrupy Inc)

Vidyuth Srinivasan (Entrupy Inc)

Vishal Kanchan (Entrupy Inc)

Lakshminarayanan Subramanian (Entrupy Inc and New York University)

Automatic Application Identification from Billions of Files (Page 2021)
Kyle Soska (Carnegie Mellon University)

Chris Gates (Symantec)

Kevin A. Roundy (Symantec)

Nicolas Christin (Carnegie Mellon University)

Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention (Page 2031)
Bin Tong (Hitachi, Ltd.)

Martin Klinkigt (Hitachi, Ltd.)

Makoto Iwayama (Hitachi, Ltd.)

Toshihiko Yanase (Hitachi, Ltd.)

Yoshiyuki Kobayashi (Hitachi, Ltd.)

Anshuman Sahu (Hitachi America, Ltd.)

Ravigopal Vennelakanti (Hitachi America, Ltd.)

Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes (Page 2041)
Toshimitsu Uesaka (The University of Tokyo)

Kai Morino (The University of Tokyo)

Hiroki Sugiura (The University of Tokyo)

Taichi Kiwaki (The University of Tokyo)

Hiroshi Murata (The University of Tokyo)

Ryo Asaoka (The University of Tokyo)

Kenji Yamanishi (The University of Tokyo)

Dynamic Attention Deep Model for Article Recommendation by Learning Human Editors' Demonstration (Page 2051)
Xuejian Wang (Shanghai Jiao Tong University)

Lantao Yu (Shanghai Jiao Tong University)

Kan Ren (Shanghai Jiao Tong University)

Guanyu Tao (ULU Technologies Inc.)

Weinan Zhang (Shanghai Jiao Tong University)

Yong Yu (Shanghai Jiao Tong University)

Jun Wang (University College London)

A Hybrid Framework for Text Modeling with Convolutional RNN (Page 2061)
Chenglong Wang (Alibaba Group)

Feijun Jiang (Alibaba Group)

Hongxia Yang (Alibaba Group)

Formative Essay Feedback Using Predictive Scoring Models (Page 2071)
Bronwyn Woods (Turnitin)

David Adamson (Turnitin)

Shayne Miel (Turnitin)

Elijah Mayfield (Turnitin)

Learning Temporal State of Diabetes Patients via Combining Behavioral and Demographic Data (Page 2081)
Houping Xiao (SUNY Buffalo & T.J. Watson Research Center)

Jing Gao (SUNY Buffalo)

Long Vu (IBM T.J. Watson Research Center)

Deepak S. Turaga (IBM T.J. Watson Research Center)

Local Algorithm for User Action Prediction Towards Display Ads (Page 2091)
Hongxia Yang (Alibaba Group)

Yada Zhu (IBM Research)

Jingrui He (Arizona State University)

Visual Search at eBay (Page 2101)
Fan Yang (eBay Inc.)

Ajinkya Kale (eBay Inc.)

Yury Bubnov (eBay Inc.)

Leon Stein (eBay Inc.)

Qiaosong Wang (eBay Inc.)

Hadi Kiapour (eBay Inc.)

Robinson Piramuthu (eBay Inc.)

A Data-driven Process Recommender Framework (Page 2111)
Sen Yang (Rutgers University)

Xin Dong (Rutgers University)

Leilei Sun (Tsinghua University)

Yichen Zhou (Rutgers University)

Richard A. Farneth (Children's National Medical Center)

Hui Xiong (Rutgers University)

Randall S. Burd (Children's National Medical Center)

Ivan Marsic (Rutgers University)

Predicting Optimal Facility Location without Customer Locations (Page 2121)
Emre Yilmaz (Bilkent University)

Sanem Elbasi (Bilkent University)

Hakan Ferhatosmanoglu (Bilkent University)

DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks (Page 2131)
Zi Yin (Stanford University & Microsoft)

Keng-hao Chang (Microsoft)

Ruofei Zhang (Microsoft)

Stock Price Prediction via Discovering Multi-Frequency Trading Patterns (Page 2141)
Liheng Zhang (University of Central Florida)

Charu Aggarwal (IBM T. J. Watson Research Center)

Guo-Jun Qi (University of Central Florida)

A Taxi Order Dispatch Model based On Combinatorial Optimization (Page 2151)
Lingyu Zhang (Didi Research Institute, Didi Chuxing)

Tao Hu (Didi Research Institute, Didi Chuxing)

Yue Min (Didi Research Institute, Didi Chuxing)

Guobin Wu (Didi Research Institute, Didi Chuxing)

Junying Zhang (Didi Research Institute, Didi Chuxing)

Pengcheng Feng (Didi Research Institute, Didi Chuxing)

Pinghua Gong (Didi Research Institute, Didi Chuxing)

Jieping Ye (Didi Research Institute, Didi Chuxing)

Contextual Spatial Outlier Detection with Metric Learning (Page 2161)
Guanjie Zheng (Pennsylvania State University)

Susan L. Brantley (Pennsylvania State University)

Thomas Lauvaux (Pennsylvania State University)

Zhenhui Li (Pennsylvania State University)

Resolving the Bias in Electronic Medical Records (Page 2171)
Kaiping Zheng (National University of Singapore)

Jinyang Gao (National University of Singapore)

Kee Yuan Ngiam (National University Health System)

Beng Chin Ooi (National University of Singapore)

Wei Luen James Yip (National University Health System)

STAR: A System for Ticket Analysis and Resolution (Page 2181)
Wubai Zhou (Florida International University)

Wei Xue (Florida International University)

Ramesh Baral (Florida International University)

Qing Wang (Florida International University)

Chunqiu Zeng (Florida International University)

Tao Li (Florida International University)

Jian Xu (Nanjing University of Science and Technology)

Zheng Liu (Nanjing University of Posts and Telecommunications)

Larisa Shwartz (IBM T.J. Watson Research Center)

Genady Ya. Grabarnik (St. John's University, Queens)

Optimized Cost per Click in Taobao Display Advertising (Page 2191)
Han Zhu (Alibaba Group)

Junqi Jin (Alibaba Group)

Chang Tan (Alibaba Group)

Fei Pan (Alibaba Group)

Yifan Zeng (Alibaba Group)

Han Li (Alibaba Group)

Kun Gai (Alibaba Group)