|
|
|
Table
of Contents
Chairs'
Welcome Message
Stan Matwin (Dalhousie University)
Shipeng Yu (LinkedIn)
Faisal Farooq (IBM)
KDD
2017 Organization
KDD
2017 Sponsors and Supporters |
| |
|
| |
KDD
2017 Keynote Talks
What's
Fair? (Page 1)
Cynthia Dwork (Microsoft Research & Harvard University)
The
Future of Data Integration (Page
3)
Renée J. Miller (University of Toronto)
Three
Principles of Data Science: Predictability, Stability and Computability (Page
5)
Bin Yu (University of California, Berkeley) |
| |
KDD
2017 Applied Invited Talks
Foreword
to the Applied Data Science - Invited Talks Track at KDD-2017 (Page
7)
Usama M. Fayyad (Open Insights)
Evangelos Simoudis (Synapse Partners)
Ashok Srivastava (Verizon)
More
than the Sum of its Parts: Building Domino Data Lab (Page
9)
Eduardo Ariño de la Rubia (Domino Data Lab)
Mining
Big Data in NeuroGenetics to Understand Muscular Dystrophy (Page
11)
Andy Berglund (University of Florida)
Industrial
Machine Learning (Page
13)
Josh Bloom (GE)
Behavior
Informatics to Discover Behavior Insight for Active and Tailored Client
Management (Page
15)
Longbing Cao (University of Technology Sydney) |
| |
It
Takes More than Math and Engineering to Hit the Bullseye with Data (Page
17)
Paritosh Desai (Target)
Planning
and Learning under Uncertainty: Theory and Practice (Page
19)
Jonathan P. How (Massachusetts Institute of Technology)
Big
Data in Climate: Opportunities and Challenges for Machine Learning (Page
21)
Anuj Karpatne (University of Minnesota)
Vipin Kumar (University of Minnesota)
Addressing
Challenges with Big Data for Media Measurement (Page
23)
Mainak Mazumdar (Nielsen)
Machine
Learning Software in Practice: Quo Vadis? (Page
25)
Szilárd Pafka (Epoch)
Designing
AI at Scale to Power Everyday Life (Page
27)
Rajesh Parekh (Facebook)
Spaceborne
Data Enters the Mainstream (Page
29)
David Potere (Tellus Laboratories) |
| |
KDD
2017 Panels
Benchmarks
and Process Management in Data Science: Will We Ever Get Over the Mess? (Page
31)
Usama M. Fayyad (Open Insights)
Arno Candel (H2O.ai, Inc.)
Eduardo Ariño de la Rubia (Domino Data Lab)
Szilárd Pafka (Epoch)
Anthony Chong (IKASI)
Jeong-Yoon Lee (Microsoft)
The
Future of Artificially Intelligent Assistants (Page
33)
Muthu Muthukrishnan (Rutgers University)
Andrew Tomkins (Google)
Larry Heck (Google)
Alborz Geramifard (Amazon)
Deepak Agarwal (LinkedIn) |
| |
KDD
2017 Research Papers (Oral Papers)
Learning
Certifiably Optimal Rule Lists (Page
35)
Elaine Angelino (University of California, Berkeley)
Nicholas Larus-Stone (Harvard University)
Daniel Alabi (Harvard University)
Margo Seltzer (Harvard University)
Cynthia Rudin (Duke University)
Improved
Degree Bounds and Full Spectrum Power Laws in Preferential Attachment
Networks (Page 45)
Chen Avin (Ben Gurion University of the Negev)
Zvi Lotker (Ben Gurion University of the Negev)
Yinon Nahum (Weizmann Institute of Science)
David Peleg (Weizmann Institute of Science)
Unsupervised
Network Discovery for Brain Imaging Data (Page
55)
Zilong Bai (University of California, Davis)
Peter Walker (Naval Medical Research Center)
Anna Tschiffely (Naval Medical Research Center)
Fei Wang (Cornell University)
Ian Davidson (University of California, Davis)
Patient
Subtyping via Time-Aware LSTM Networks (Page
65)
Inci M. Baytas (Michigan State University)
Cao Xiao (IBM T. J. Watson Research Center)
Xi Zhang (Cornell University)
Fei Wang (Cornell University)
Anil K. Jain (Michigan State University)
Jiayu Zhou (Michigan State University) |
| |
Robust
Top-k Multiclass SVM for Visual Category Recognition (Page
75)
Xiaojun Chang (Carnegie Mellon University)
Yao-Liang Yu (University of Waterloo)
Yi Yang (University of Technology Sydney)
KATE:
K-Competitive Autoencoder for Text (Page
85)
Yu Chen (Rensselaer Polytechnic Institute)
Mohammed J. Zaki (Rensselaer Polytechnic Institute)
A
Minimal Variance Estimator for the Cardinality of Big Data Set Intersection (Page
95)
Reuven Cohen (Technion)
Liran Katzir (Technion)
Aviv Yehezkel (Technion)
HyperLogLog
Hyperextended: Sketches for Concave Sublinear Frequency Statistics (Page
105)
Edith Cohen (Google Research)
Fast
Enumeration of Large k-Plexes (Page
115)
Alessio Conte (University of Pisa)
Donatella Firmani (Roma Tre University)
Caterina Mordente (Be Think Solve Execute)
Maurizio Patrignani (Roma Tre University)
Riccardo Torlone (Roma Tre University) |
| |
Matrix
Profile V: A Generic Technique to Incorporate Domain Knowledge into Motif
Discovery (Page 125)
Hoang Anh Dau (University of California Riverside)
Eamonn Keogh (University of California Riverside)
metapath2vec:
Scalable Representation Learning for Heterogeneous Networks (Page
135)
Yuxiao Dong (Microsoft Research & University of Notre Dame)
Nitesh V. Chawla (University of Notre Dame)
Ananthram Swami (Army Research Laboratory)
Ego-Splitting
Framework: from Non-Overlapping to Overlapping Clusters (Page
145)
Alessandro Epasto (Google Research)
Silvio Lattanzi (Google Research Zurich)
Renato Paes Leme (Google Research)
Contextual
Motifs: Increasing the Utility of Motifs using Contextual Data (Page
155)
Ian Fox (University of Michigan)
Lynn Ang (University of Michigan)
Mamta Jaiswal (University of Michigan)
Rodica Pop-Busui (University of Michigan)
Jenna Wiens (University of Michigan)
Unsupervised
P2P Rental Recommendations via Integer Programming (Page
165)
Yanjie Fu (Missouri University of Science and Technology)
Guannan Liu (Beihang University)
Mingfei Teng (Rutgers University)
Charu Aggarwal (IBM T. J. Watson Research Center) |
| |
The
Co-Evolution Model for Social Network Evolving and Opinion Migration (Page
175)
Yupeng Gu (University of California, Los Angeles)
Yizhou Sun (University of California, Los Angeles)
Jianxi Gao (Northeastern University)
Groups-Keeping
Solution Path Algorithm for Sparse Regression with Automatic Feature Grouping (Page
185)
Bin Gu (University of Texas at Arlington)
Guodong Liu (University of Texas at Arlington)
Heng Huang (University of Texas at Arlington)
Clustering
Individual Transactional Data for Masses of Users (Page
195)
Riccardo Guidotti (ISTI-CNR & University of Pisa)
Anna Monreale (University of Pisa)
Mirco Nanni (ISTI-CNR)
Fosca Giannotti (ISTI-CNR)
Dino Pedreschi (University of Pisa)
Network
Inference via the Time-Varying Graphical Lasso (Page
205)
David Hallac (Stanford University)
Youngsuk Park (Stanford University)
Stephen Boyd (Stanford University)
Jure Leskovec (Stanford University)
Toeplitz
Inverse Covariance-Based Clustering of Multivariate Time Series Data (Page
215)
David Hallac (Stanford University)
Sagar Vare (Stanford University)
Stephen Boyd (Stanford University)
Jure Leskovec (Stanford University) |
| |
Efficient
Correlated Topic Modeling with Topic Embedding (Page
225)
Junxian He (Carnegie Mellon University & Shanghai Jiao Tong University)
Zhiting Hu (Carnegie Mellon University & Petuum Inc.)
Taylor Berg-Kirkpatrick (Carnegie Mellon University)
Ying Huang (Shanghai Jiao Tong University)
Eric P. Xing (Carnegie Mellon University & Petuum Inc.)
Accelerating
Innovation Through Analogy Mining (Page
235)
Tom Hope (Hebrew University of Jerusalem)
Joel Chan (Carnegie Mellon University)
Aniket Kittur (Carnegie Mellon University)
Dafna Shahaf (Hebrew University of Jerusalem)
Communication-Efficient
Distributed Block Minimization for Nonlinear Kernel Machines (Page
245)
Cho-Jui Hsieh (University of California, Davis)
Si Si (Google Inc. & Google Research)
Inderjit S. Dhillon (University of Texas at Austin)
A
Hierarchical Algorithm for Extreme Clustering (Page
255)
Ari Kobren (University of Massachusetts, Amherst)
Nicholas Monath (University of Massachusetts, Amherst)
Akshay Krishnamurthy (University of Massachusetts, Amherst)
Andrew McCallum (University of Massachusetts, Amherst) |
| |
Estimating
Treatment Effect in the Wild via Differentiated Confounder Balancing (Page
265)
Kun Kuang (Tsinghua University)
Peng Cui (Tsinghua University)
Bo Li (Tsinghua Univeristy)
Meng Jiang (University of Notre Dame)
Shiqiang Yang (Tsinghua University)
The
Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence
of Unobservables (Page
275)
Himabindu Lakkaraju (Stanford University)
Jon Kleinberg (Cornell University)
Jure Leskovec (Stanford University)
Jens Ludwig (University of Chicago)
Sendhil Mullainathan (Harvard University)
Constructivism
Learning: A Learning Paradigm for Transparent Predictive Analytics (Page
285)
Xiaoli Li (University of Kansas)
Jun Huan (University of Kansas)
Is
the Whole Greater Than the Sum of Its Parts? (Page
295)
Liangyue Li (Arizona State University)
Hanghang Tong (Arizona State University)
Yong Wang (Hong Kong University of Science and Technology)
Conglei Shi (IBM Research)
Nan Cao (Tongji University)
Norbou Buchler (US Army Research Laboratory) |
| |
Collaborative
Variational Autoencoder for Recommender Systems (Page
305)
Xiaopeng Li (Hong Kong University of Science and Technology)
James She (Hong Kong University of Science and Technology)
Linearized
GMM Kernels and Normalized Random Fourier Features (Page
315)
Ping Li (Rutgers University)
Discrete
Content-aware Matrix Factorization (Page
325)
Defu Lian (University of Electronic Science and Technology of China)
Rui Liu (University of Electronic Science and Technology of China)
Yong Ge (University of Arizona)
Kai Zheng (University of Electronic Science and Technology of China)
Xing Xie (Microsoft Research)
Longbing Cao (University of Technology Sydney)
Effective
and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams (Page
335)
Junming Liu (Rutgers University)
Yanjie Fu (Missouri University of Science and Technology)
Jingci Ming (Rutgers University)
Yong Ren (Futurewei Tech. Inc)
Leilei Sun (Tsinghua University)
Hui Xiong (Rutgers University) |
| |
Functional
Annotation of Human Protein Coding Isoforms via Non-convex Multi-Instance
Learning (Page 345)
Tingjin Luo (National University of Defense Technology)
Weizhong Zhang (Zhejiang University)
Shang Qiu (University of Michigan)
Yang Yang (Beihang University)
Dongyun Yi (National University of Defense Technology)
Guangtao Wang (University of Michigan)
Jieping Ye (University of Michigan)
Jie Wang (University of Michigan)
Discovering
Reliable Approximate Functional Dependencies (Page
355)
Panagiotis Mandros (Max Planck Institute for Informatics & Saarland
University)
Mario Boley (Max Planck Institute for Informatics & Saarland University)
Jilles Vreeken (Max Planck Institute for Informatics & Saarland University)
Towards
an Optimal Subspace for K-Means (Page
365)
Dominik Mautz (Ludwig-Maximilians-Universität München)
Wei Ye (Ludwig-Maximilians-Universität München)
Claudia Plant (University of Vienna)
Christian Böhm (Ludwig-Maximilians-Universität München)
SPARTan:
Scalable PARAFAC2 for Large & Sparse Data (Page
375)
Ioakeim Perros (Georgia Institute of Technology)
Evangelos E. Papalexakis (University of California, Riverside)
Fei Wang (Weill Cornell Medicine)
Richard Vuduc (Georgia Institute of Technology)
Elizabeth Searles (Children's Healthcare Of Atlanta)
Michael Thompson (Children's Healthcare Of Atlanta)
Jimeng Sun (Georgia Institute of Technology) |
| |
struc2vec:
Learning Node Representations from Structural Identity (Page
385)
Leonardo F. R. Ribeiro (Federal University of Rio de Janeiro)
Pedro H. P. Saverese (Federal University of Rio de Janeiro)
Daniel R. Figueiredo (Federal University of Rio de Janeiro)
Similarity
Forests (Page 395)
Saket Sathe (IBM T. J. Watson Research Center)
Charu C. Aggarwal (IBM T. J. Watson Research Center)
Online
Ranking with Constraints: A Primal-Dual Algorithm and Applications to
Web Traffic-Shaping (Page
405)
Parikshit Shah (Yahoo Research)
Akshay Soni (Yahoo Research)
Troy Chevalier (Yahoo Research)
On
Finding Socially Tenuous Groups for Online Social Networks (Page
415)
Chih-Ya Shen (National Tsing Hua University)
Liang-Hao Huang (Academia Sinica)
De-Nian Yang (Academia Sinica)
Hong-Han Shuai (National Chiao Tung University)
Wang-Chien Lee (The Pennsylvania State University)
Ming-Syan Chen (National Taiwan University)
PReP:
Path-Based Relevance from a Probabilistic Perspective in Heterogeneous
Information Networks (Page
425)
Yu Shi (University of Illinois at Urbana-Champaign)
Po-Wei Chan (University of Illinois at Urbana-Champaign)
Honglei Zhuang (University of Illinois at Urbana-Champaign)
Huan Gui (University of Illinois at Urbana-Champaign)
Jiawei Han (University of Illinois at Urbana-Champaign) |
| |
Multi-Aspect
Streaming Tensor Completion (Page
435)
Qingquan Song (Texas A&M University)
Xiao Huang (Texas A&M University)
Hancheng Ge (Texas A&M University)
James Caverlee (Texas A&M University)
Xia Hu (Texas A&M University & Texas A&M Engineering Experiment Station)
Scalable
and Sustainable Deep Learning via Randomized Hashing (Page
445)
Ryan Spring (Rice University)
Anshumali Shrivastava (Rice University)
AnnexML:
Approximate Nearest Neighbor Search for Extreme Multi-label Classification (Page
455)
Yukihiro Tagami (Yahoo Japan Corporation & Kyoto University)
Interpretable
Predictions of Tree-based Ensembles via Actionable Feature Tweaking (Page
465)
Gabriele Tolomei (Yahoo Research)
Fabrizio Silvestri (Facebook)
Andrew Haines (Yahoo Research)
Mounia Lalmas (Yahoo Research) |
| |
Structural
Deep Brain Network Mining (Page
475)
Shen Wang (University of Illinois at Chicago)
Lifang He (Shenzhen University)
Bokai Cao (University of Illinois at Chicago)
Chun-Ta Lu (University of Illinois at Chicago)
Philip S. Yu (University of Illinois at Chicago)
Ann B. Ragin (Northwestern University)
Randomized
Feature Engineering as a Fast and Accurate Alternative to Kernel Methods (Page
485)
Suhang Wang (Arizona State University)
Charu Aggarwal (IBM T. J. Watson Research Center)
Huan Liu (Arizona State University)
Human
Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes
Processes (Page 495)
Pengfei Wang (Chinese Academy of Sciences)
Yanjie Fu (Missouri University of Science and Technology)
Guannan Liu (Beihang University)
Wenqing Hu (Missouri University of Science and Technology)
Charu Aggarwal (IBM T. J. Watson Research Center)
FORA:
Simple and Effective Approximate Single-Source Personalized PageRank (Page
505)
Sibo Wang (University of Queensland & Nanyang Technological University)
Renchi Yang (Nanyang Technological University)
Xiaokui Xiao (Nanyang Technological University)
Zhewei Wei (Renmin University of China & Nanyang Technological University)
Yin Yang (Hamad Bin Khalifa University) |
| |
Large-scale
Collaborative Ranking in Near-Linear Time (Page
515)
Liwei Wu (University of California, Davis)
Cho-Jui Hsieh (University of California, Davis)
James Sharpnack (University of California, Davis)
HoORaYs:
High-order Optimization of Rating Distance for Recommender Systems (Page
525)
Jingwei Xu (Nanjing University)
Yuan Yao (Nanjing University)
Hanghang Tong (Arizona State University)
Xianping Tao (Nanjing University)
Jian Lu (Nanjing University)
Collaboratively
Improving Topic Discovery and Word Embeddings by Coordinating Global and
Local Contexts (Page
535)
Guangxu Xun (SUNY at Buffalo)
Yaliang Li (SUNY at Buffalo & Baidu Research Big Data Lab)
Jing Gao (SUNY at Buffalo)
Aidong Zhang (SUNY at Buffalo)
PPDsparse:
A Parallel Primal-Dual Sparse Method for Extreme Classification (Page
545)
Ian E.H. Yen (Carnegie Mellon University)
Xiangru Huang (University of Texas at Austin)
Wei Dai (Carnegie Mellon University & Petuum Inc.)
Pradeep Ravikumar (Carnegie Mellon University)
Inderjit Dhillon (University of Texas at Austin)
Eric Xing (Carnegie Mellon University & Petuum Inc.) |
| |
Local
Higher-Order Graph Clustering (Page
555)
Hao Yin (Stanford University)
Austin R. Benson (Stanford University)
Jure Leskovec (Stanford University)
David F. Gleich (Purdue University)
Long
Short Memory Process: Modeling Growth Dynamics of Microscopic Social Connectivity (Page
565)
Chengxi Zang (Tsinghua University)
Peng Cui (Tsinghua University)
Christos Faloutsos (Carnegie Mellon University)
Wenwu Zhu (Tsinghua University)
Weisfeiler-Lehman
Neural Machine for Link Prediction (Page
575)
Muhan Zhang (Washington University in St. Louis)
Yixin Chen (Washington University in St. Louis)
EmbedJoin:
Efficient Edit Similarity Joins via Embeddings (Page
585)
Haoyu Zhang (Indiana University Bloomington)
Qin Zhang (Indiana University Bloomington)
TrioVecEvent:
Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams (Page
595)
Chao Zhang (University of Illinois at Urbana-Champaign)
Liyuan Liu (University of Illinois at Urbana-Champaign)
Dongming Lei (University of Illinois at Urbana-Champaign)
Quan Yuan (University of Illinois at Urbana-Champaign)
Honglei Zhuang (University of Illinois at Urbana-Champaign)
Tim Hanratty (U.S. Army Research Lab)
Jiawei Han (University of Illinois at Urbana-Champaign) |
| |
Graph
Edge Partitioning via Neighborhood Heuristic (Page
605)
Chenzi Zhang (University of Hong Kong & Noah's Ark Lab)
Fan Wei (Stanford University)
Qin Liu (Huawei Noah's Ark Lab & Chinese University of Hong Kong)
Zhihao Gavin Tang (University of Hong Kong)
Zhenguo Li (Huawei Noah's Ark Lab)
Randomization
or Condensation? Linear-Cost Matrix Sketching Via Cascaded Compression
Sampling (Page 615)
Kai Zhang (Temple University)
Chuanren Liu (Drexel University)
Jie Zhang (Fudan University)
Hui Xiong (Rutgers University)
Eric Xing (Carneigie Mellon University)
Jieping Ye (University of Michigan, Ann Arbor)
Tracking
the Dynamics in Crowdfunding (Page
625)
Hongke Zhao (University of Science and Technology of China)
Hefu Zhang (University of Science and Technology of China)
Yong Ge (University of Arizona)
Qi Liu (University of Science and Technology of China)
Enhong Chen (University of Science and Technology of China)
Huayu Li (University of North Carolina at Charlotte)
Le Wu (Hefei University of Technology) |
| |
Meta-Graph
Based Recommendation Fusion over Heterogeneous Information Networks (Page
635)
Huan Zhao (Hong Kong University of Science and Technology)
Quanming Yao (Hong Kong University of Science and Technology)
Jianda Li (Hong Kong University of Science and Technology)
Yangqiu Song (Hong Kong University of Science and Technology)
Dik Lun Lee (Hong Kong University of Science and Technology)
Coresets
for Kernel Regression (Page
645)
Yan Zheng (University of Utah)
Jeff M. Phillips (University of Utah)
A
Local Algorithm for Structure-Preserving Graph Cut (Page
655)
Dawei Zhou (Arizona State University)
Si Zhang (Arizona State University)
Mehmet Yigit Yildirim (Arizona State University)
Scott Alcorn (Early Warnings LLC.)
Hanghang Tong (Arizona State University)
Hasan Davulcu (Arizona State University)
Jingrui He (Arizona State University)
Anomaly
Detection with Robust Deep Autoencoders (Page
665)
Chong Zhou (Worcester Polytechnic Institute)
Randy C. Paffenroth (Worcester Polytechnic Institute) |
| |
KDD
2017 Research Papers (Poster Papers)
Effective
Evaluation Using Logged Bandit Feedback from Multiple Loggers (Page
687)
Aman Agarwal (Cornell University)
Soumya Basu (Cornell University)
Tobias Schnabel (Cornell University)
Thorsten Joachims (Cornell University)
Tripoles:
A New Class of Relationships in Time Series Data (Page
697)
Saurabh Agrawal (University of Minnesota)
Gowtham Atluri (University of Cincinnati)
Anuj Karpatne (University of Minnesota)
William Haltom (University of Minnesota)
Stefan Liess (Univertsity of Minnesota)
Snigdhansu Chatterjee (University of Minnesota)
Vipin Kumar (University of Minnesota)
Post
Processing Recommender Systems for Diversity (Page
707)
Arda Antikacioglu (Carnegie Mellon University)
R Ravi (Carnegie Mellon University)
Aspect
Based Recommendations: Recommending Items with the Most Valuable Aspects
Based on User Reviews (Page
717)
Konstantin Bauman (New York University)
Bing Liu (University of Illinois at Chicago (UIC))
Alexander Tuzhilin (New York University) |
| |
Bolt:
Accelerated Data Mining with Fast Vector Compression (Page
727)
Davis W. Blalock (Massachusetts Institute of Technology)
John V. Guttag (Massachusetts Institute of Technology)
Robust
Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves
Latent Embeddings (Page
737)
Aleksandar Bojchevski (Technical University of Munich)
Yves Matkovic (Technical University of Munich)
Stephan Günnemann (Technical University of Munich)
DeepMood:
Modeling Mobile Phone Typing Dynamics for Mood Detection (Page
747)
Bokai Cao (University of Illinois at Chicago)
Lei Zheng (University of Illinois at Chicago)
Chenwei Zhang (University of Illinois at Chicago)
Philip S. Yu (Tsinghua University & University of Illinois at Chicago)
Andrea Piscitello (University of Illinois at Chicago)
John Zulueta (University of Illinois at Chicago)
Olu Ajilore (University of Illinoisat Chicago)
Kelly Ryan (University of Michigan)
Alex D. Leow (University of Illinois at Chicago)
Fast
Newton Hard Thresholding Pursuit for Sparsity Constrained Nonconvex Optimization (Page
757)
Jinghui Chen (University of Virginia)
Quanquan Gu (University of Virginia) |
| |
On
Sampling Strategies for Neural Network-based Collaborative Filtering (Page
767)
Ting Chen (University of California, Los Angeles)
Yizhou Sun (University of California, Los Angeles)
Yue Shi (Yahoo! Research)
Liangjie Hong (Etsy Inc.)
Unsupervised
Feature Selection in Signed Social Networks (Page
777)
Kewei Cheng (Arizona State University)
Jundong Li (Arizona State University)
Huan Liu (Arizona State University)
GRAM:
Graph-based Attention Model for Healthcare Representation Learning (Page
787)
Edward Choi (Georgia Institute of Technology)
Mohammad Taha Bahadori (Georgia Institute of Technology)
Le Song (Georgia Institute of Technology)
Walter F. Stewart (Sutter Health)
Jimeng Sun (Georgia Institute of Technology)
Algorithmic
Decision Making and the Cost of Fairness (Page
797)
Sam Corbett-Davies (Stanford University)
Emma Pierson (Stanford University)
Avi Feller (University of California, Berkeley)
Sharad Goel (Stanford University)
Aziz Huq (University of Chicago) |
| |
Structural
Diversity and Homophily: A Study Across More Than One Hundred Big Networks (Page
807)
Yuxiao Dong (Microsoft Research & University of Notre Dame)
Reid A. Johnson (University of Notre Dame)
Jian Xu (University of Notre Dame)
Nitesh V. Chawla (University of Notre Dame)
Revisiting
Power-law Distributions in Spectra of Real World Networks (Page
817)
Nicole Eikmeier (Purdue University)
David F. Gleich (Purdue University)
REMIX:
Automated Exploration for Interactive Outlier Detection (Page
827)
Yanjie Fu (Missouri University of Science & Technology)
Charu Aggarwal (IBM T. J. Watson Research Center)
Srinivasan Parthasarathy (IBM T. J. Watson Research Center)
Deepak S. Turaga (IBM T. J. Watson Research Center)
Hui Xiong (Rutgers University)
Anarchists,
Unite: Practical Entropy Approximation for Distributed Streams (Page
837)
Moshe Gabel (Technion - Israel Institute of Technology)
Daniel Keren (Haifa University)
Assaf Schuster (Technion - Israel Institute of Technology) |
| |
Recurrent
Poisson Factorization for Temporal Recommendation (Page
847)
Seyed Abbas Hosseini (Sharif University of Technology)
Keivan Alizadeh (Sharif University of Technology)
Ali Khodadadi (Sharif University of Technology)
Ali Arabzadeh (Sharif University of Technology)
Mehrdad Farajtabar (Georgia Institute of Technology)
Hongyuan Zha (Georgia Institute of Technology)
Hamid R. Rabiee (Sharif University of Technology)
SPOT:
Sparse Optimal Transformations for High Dimensional Variable Selection
and Exploratory Regression Analysis (Page
857)
Qiming Huang (Purdue University)
Michael Zhu (Purdue University & Tsinghua University)
Incremental
Dual-memory LSTM in Land Cover Prediction (Page
867)
Xiaowei Jia (University of Minnesota)
Ankush Khandelwal (University of Minnesota)
Guruprasad Nayak (University of Minnesota)
James Gerber (University of Minnesota)
Kimberly Carlson (University of Hawaii Manoa)
Paul West (University of Minnesota)
Vipin Kumar (University of Minnesota)
MetaPAD:
Meta Pattern Discovery from Massive Text Corpora (Page
877)
Meng Jiang (University of Illinois at Urbana-Champaign)
Jingbo Shang (University of Illinois at Urbana-Champaign)
Taylor Cassidy (Army Research Laboratory)
Xiang Ren (University of Illinois at Urbana-Champaign)
Lance M. Kaplan (Army Research Laboratory)
Timothy P. Hanratty (Army Research Laboratory)
Jiawei Han (University of Illinois at Urbana-Champaign) |
| |
Federated
Tensor Factorization for Computational Phenotyping (Page
887)
Yejin Kim (Pohang University of Science and Technology & University
of California, San Diego)
Jimeng Sun (Georgia Institute of Technology)
Hwanjo Yu (Pohang University of Science and Technology)
Xiaoqian Jiang (University of California, San Diego)
Statistical
Emerging Pattern Mining with Multiple Testing Correction (Page
897)
Junpei Komiyama (University of Tokyo)
Masakazu Ishihata (Hokkaido University)
Hiroki Arimura (Hokkaido University)
Takashi Nishibayashi (VOYAGE GROUP, Inc.)
Shin-ichi Minato (Hokkaido University)
Semi-Supervised
Techniques for Mining Learning Outcomes and Prerequisites (Page
907)
Igor Labutov (Carnegie Mellon University)
Yun Huang (University of Pittsburgh)
Peter Brusilovsky (University of Pittsburgh)
Daqing He (University of Pittsburgh)
Prospecting
the Career Development of Talents: A Survival Analysis Perspective (Page
917)
Huayu Li (University of North Carolina at Charlotte & Baidu Talent
Intelligence Center)
Yong Ge (University of Arizona)
Hengshu Zhu (Baidu Talent Intelligence Center)
Hui Xiong (Rutgers University)
Hongke Zhao (University of Sci. and Tech. of China) |
| |
A
Context-aware Attention Network for Interactive Question Answering (Page
927)
Huayu Li (University of North Carolina, Charlotte)
Martin Renqiang Min (NEC Laboratories America)
Yong Ge (University of Arizona)
Asim Kadav (NEC Laboratories America)
Distributed
Multi-Task Relationship Learning (Page
937)
Sulin Liu (Nanyang Technological University, Singapore)
Sinno Jialin Pan (Nanyang Technological University, Singapore)
Qirong Ho (Petuum, Inc.)
Point-of-Interest
Demand Modeling with Human Mobility Patterns (Page
947)
Yanchi Liu (Rutgers University)
Chuanren Liu (Drexel University)
Xinjiang Lu (Northwestern Polytechnical University)
Mingfei Teng (Rutgers University)
Hengshu Zhu (Baidu Talent Intelligence Center)
Hui Xiong (Rutgers University)
Functional
Zone Based Hierarchical Demand Prediction For Bike System Expansion (Page
957)
Junming Liu (Rutgers University)
Leilei Sun (Tsinghua University)
Qiao Li (Rutgers University)
Jingci Ming (Rutgers University)
Yanchi Liu (Rutgers University)
Hui Xiong (Rutgers University) |
| |
Unsupervised
Discovery of Drug Side-Effects from Heterogeneous Data Sources (Page
967)
Fenglong Ma (SUNY Buffalo)
Chuishi Meng (SUNY Buffalo)
Houping Xiao (SUNY Buffalo)
Qi Li (SUNY Buffalo)
Jing Gao (SUNY Buffalo)
Lu Su (SUNY Buffalo)
Aidong Zhang (SUNY Buffalo)
Let's
See Your Digits: Anomalous-State Detection using Benford's Law (Page
977)
Samuel Maurus (Technical University of Munich)
Claudia Plant (University of Vienna)
Mixture
Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting (Page
987)
Guo-Jun Qi (University of Central Florida)
Jiliang Tang (Michigan State University)
Jingdong Wang (Microsoft Research Asia and Hefei University of Technology)
Jiebo Luo (University of Rochester)
Automatic
Synonym Discovery with Knowledge Bases (Page
997)
Meng Qu (University of Illinois at Urbana-Champaign)
Xiang Ren (University of Illinois at Urbana-Champaign)
Jiawei Han (University of Illinois at Urbana-Champaign) |
| |
An
Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance (Page
1007)
Edward Raff (Laboratory for Physical Sciences)
Charles Nicholas (University of Maryland, Baltimore County)
Inferring
the Strength of Social Ties: A Community-Driven Approach (Page
1017)
Polina Rozenshtein (Aalto University)
Nikolaj Tatti (Aalto University)
Aristides Gionis (Aalto University)
Detecting
Network Effects: Randomizing Over Randomized Experiments (Page
1027)
Martin Saveski (Massachusetts Institute of Technology)
Jean Pouget-Abadie (Harvard University)
Guillaume Saint-Jacques (Massachusetts Institute of Technology)
Weitao Duan (LinkedIn)
Souvik Ghosh (LinkedIn)
Ya Xu (LinkedIn)
Edoardo M. Airoldi (Harvard University)
When
is a Network a Network? Multi-Order Graphical Model Selection in Pathways
and Temporal Networks (Page
1037)
Ingo Scholtes (ETH Zürich) |
| |
ReasoNet:
Learning to Stop Reading in Machine Comprehension (Page
1047)
Yelong Shen (Microsoft Research)
Po-Sen Huang (Microsoft Research)
Jianfeng Gao (Microsoft Research)
Weizhu Chen (Microsoft Research)
DenseAlert:
Incremental Dense-Subtensor Detection in Tensor Streams (Page
1057)
Kijung Shin (Carnegie Mellon University)
Bryan Hooi (Carnegie Mellon University)
Jisu Kim (Carnegie Mellon University)
Christos Faloutsos (Carnegie Mellon University)
Anomaly
Detection in Streams with Extreme Value Theory (Page
1067)
Alban Siffer (IRISA)
Pierre-Alain Fouque (IRISA)
Alexandre Termier (IRISA)
Christine Largouet (IRISA)
Relay-Linking
Models for Prominence and Obsolescence in Evolving Networks (Page
1077)
Mayank Singh (IIT Kharagpur)
Rajdeep Sarkar (IIT Kharagpur)
Pawan Goyal (IIT Kharagpur)
Animesh Mukherjee (IIT Kharagpur)
Soumen Chakrabarti (IIT Bombay) |
| |
PAMAE:
Parallel k-Medoids Clustering with High Accuracy and Efficiency (Page
1087)
Hwanjun Song (Korea Advanced Institute of Science and Technology)
Jae-Gil Lee (Korea Advanced Institute of Science and Technology)
Wook-Shin Han (POSTECH)
Sparse
Compositional Local Metric Learning (Page
1097)
Joseph St.Amand (University of Kansas)
Jun Huan (University of Kansas)
End-to-end
Learning for Short Text Expansion (Page
1105)
Jian Tang (University of Michigan)
Yue Wang (University of Michigan)
Kai Zheng (University of California, Irvine)
Qiaozhu Mei (University of Michigan)
Construction
of Directed 2K Graphs (Page
1115)
Bálint Tillman (University of California, Irvine)
Athina Markopoulou (University of California, Irvine)
Carter T. Butts (University of California, Irvine)
Minas Gjoka (University of California, Irvine)
Optimized
Risk Scores (Page
1125)
Berk Ustun (Massachusetts Institute of Technology)
Cynthia Rudin (Duke University) |
| |
A
Location-Sentiment-Aware Recommender System for Both Home-Town and Out-of-Town
Users (Page 1135)
Hao Wang (Qihoo 360 Search Lab)
Yanmei Fu (Chinese Academy of Sciences & University of Chinese Academy
of Sciences)
Qinyong Wang (University of Queensland)
Hongzhi Yin (University of Queensland)
Changying Du (Chinese Academy of Sciences)
Hui Xiong (Rutgers University)
Adversary
Resistant Deep Neural Networks with an Application to Malware Detection (Page
1145)
Qinglong Wang (Pennsylvania State University & McGill University)
Wenbo Guo (The Pennsylvania State University)
Kaixuan Zhang (The Pennsylvania State University)
Alexander G. Ororbia II (The Pennsylvania State University)
Xinyu Xing (The Pennsylvania State University)
Xue Liu (McGill University)
C. Lee Giles (The Pennsylvania State University)
Multi-Modality
Disease Modeling via Collective Deep Matrix Factorization (Page
1155)
Qi Wang (Michigan State University)
Mengying Sun (Michigan State University)
Liang Zhan (University of Wisconsin-Stout)
Paul Thompson (University of Southern California)
Shuiwang Ji (Washington State University)
Jiayu Zhou (Michigan State University) |
| |
Decomposed
Normalized Maximum Likelihood Codelength Criterion for Selecting Hierarchical
Latent Variable Models (Page
1165)
Tianyi Wu (University of Tokyo)
Shinya Sugawara (University of Tokyo)
Kenji Yamanishi (University of Tokyo)
Structural
Event Detection from Log Messages (Page
1175)
Fei Wu (The Pennsylvania State University)
Pranay Anchuri (NEC Laboratories America)
Zhenhui Li (The Pennsylvania State University)
Retrospective
Higher-Order Markov Processes for User Trails (Page
1185)
Tao Wu (Purdue University)
David F. Gleich (Purdue University)
Privacy-Preserving
Distributed Multi-Task Learning with Asynchronous Updates (Page
1195)
Liyang Xie (Michigan State University)
Inci M. Baytas (Michigan State University)
Kaixiang Lin (Michigan State University)
Jiayu Zhou (Michigan State University) |
| |
Evaluating
U.S. Electoral Representation with a Joint Statistical Model of Congressional
Roll-Calls, Legislative Text, and Voter Registration Data (Page
1205)
Zhengming Xing (Criteo Labs)
Sunshine Hillygus (Duke University)
Lawrence Carin (Duke University)
Convex
Factorization Machine for Toxicogenomics Prediction (Page
1215)
Makoto Yamada (RIKEN AIP, JST PRESTO)
Wenzhao Lian (Vicarious)
Amit Goyal (Yahoo Research)
Jianhui Chen (Microsoft)
Kishan Wimalawarne (Kyoto University)
Suleiman A. Khan (University of Helsinki)
Samuel Kaski (Aalto University)
Hiroshi Mamitsuka (Kyoto University & Aalto University)
Yi Chang (Huawei Research America)
Distributed
Local Outlier Detection in Big Data (Page
1225)
Yizhou Yan (Worcester Polytechnic Institute)
Lei Cao (Massachusetts Institute of Technology)
Caitlin Kulhman (Worcester Polytechnic Institute)
Elke Rundensteiner (Worcester Polytechnic Institute)
Scalable
Top-n Local Outlier Detection (Page
1235)
Yizhou Yan (Worcester Polytechnic Institute)
Lei Cao (Massachusetts Institute of Technology)
Elke A. Rundensteiner (Worcester Polytechnic Institute) |
| |
Bridging
Collaborative Filtering and Semi-Supervised Learning: A Neural Approach
for POI Recommendation (Page
1245)
Carl Yang (University of Illinois, Urbana Champaign)
Lanxiao Bai (University of Illinois, Urbana Champaign)
Chao Zhang (University of Illinois, Urbana Champaign)
Quan Yuan (University of Illinois, Urbana Champaign)
Jiawei Han (University of Illinois, Urbana Champaign)
Multi-task
Function-on-function Regression with Co-grouping Structured Sparsity (Page
1255)
Pei Yang (South China University of Technology & Arizona State University)
Qi Tan (South China Normal University)
Jingrui He (Arizona State University)
Learning
from Labeled and Unlabeled Vertices in Networks (Page
1265)
Wei Ye (Ludwig-Maximilians-Universität München)
Linfei Zhou (Ludwig-Maximilians-Universität München)
Dominik Mautz (Ludwig-Maximilians-Universität München)
Claudia Plant (University of Vienna)
Christian Böhm (Ludwig-Maximilians-Universität München) |
| |
Small
Batch or Large Batch? Gaussian Walk with Rebound Can Teach (Page
1275)
Peifeng Yin (IBM Almaden Research Center)
Ping Luo (Chinese Academy of Sciences & University of Chinese Academy
of Sciences)
Taiga Nakamura (IBM Almaden Research Center)
Learning
from Multiple Teacher Networks (Page
1285)
Shan You (Peking University)
Chang Xu (University of Sydney)
Chao Xu (Peking University)
Dacheng Tao (University of Sydney)
A
Temporally Heterogeneous Survival Framework with Application to Social
Behavior Dynamics (Page
1295)
Linyun Yu (Tsinghua University)
Peng Cui (Tsinghua University)
Chaoming Song (University of Miami)
Tianyang Zhang (Tsinghua University)
Shiqiang Yang (Tsinghua University)
Inductive
Semi-supervised Multi-Label Learning with Co-Training (Page
1305)
Wang Zhan (Southeast University & Ministry of Education)
Min-Ling Zhang (Southeast University & Collaborative Innovation Center
of Wireless Communications Technology) |
| |
LEAP:
Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity (Page
1315)
Yutao Zhang (Tsinghua University)
Robert Chen (Georgia Institute of Technology)
Jie Tang (Tsinghua University)
Walter F. Stewart (Sutter Health)
Jimeng Sun (Georgia Institute of Technology)
Visualizing
Attributed Graphs via Terrain Metaphor (Page
1325)
Yang Zhang (Ohio State University)
Yusu Wang (Ohio State University)
Srinivasan Parthasarathy (Ohio State University)
Achieving
Non-Discrimination in Data Release (Page
1335)
Lu Zhang (University of Arkansas)
Yongkai Wu (University of Arkansas)
Xintao Wu (University of Arkansas) |
| |
KDD
2017 Applied Data Science Papers (Oral Papers)
Using
Convolutional Networks and Satellite Imagery to Identify Patterns in Urban
Environments at a Large Scale (Page
1357)
Adrian Albert (Massachusetts Institute of Technology)
Jasleen Kaur (Philips Lighting Research)
Marta C. Gonzalez (Massachusetts Institute of Technology)
Luck
is Hard to Beat: The Difficulty of Sports Prediction (Page
1367)
Raquel Y S Aoki (Universidade Federal de Minas Gerais)
Renato M. Assuncao (Universidade Federal de Minas Gerais)
Pedro O S Vaz de Melo (Universidade Federal de Minas Gerais)
Planning
Bike Lanes based on Sharing-Bikes' Trajectories (Page
1377)
Jie Bao (Microsoft Research)
Tianfu He (Harbin Institution of Technology)
Sijie Ruan (Xidian University)
Yanhua Li (WPI)
Yu Zheng (Microsoft Research) |
| |
TFX:
A TensorFlow-Based Production-Scale Machine Learning Platform (Page
1387)
Denis Baylor (Google Inc.)
Eric Breck (Google Inc.)
Heng-Tze Cheng (Google Inc.)
Noah Fiedel (Google Inc.)
Chuan Yu Foo (Google Inc.)
Zakaria Haque (Google Inc.)
Salem Haykal (Google Inc.)
Mustafa Ispir (Google Inc.)
Vihan Jain (Google Inc.)
Levent Koc (Google Inc.)
Chiu Yuen Koo (Google Inc.)
Lukasz Lew (Google Inc.)
Clemens Mewald (Google Inc.)
Akshay Naresh Modi (Google Inc.)
Neoklis Polyzotis (Google Inc.)
Sukriti Ramesh (Google Inc.)
Sudip Roy (Google Inc.)
Steven Euijong Whang (Google Inc.)
Martin Wicke (Google Inc.)
Jarek Wilkiewicz (Google Inc.)
Xin Zhang (Google Inc.)
Martin Zinkevich (Google Inc.)
LiJAR:
A System for Job Application Redistribution towards Efficient Career Marketplace (Page
1397)
Fedor Borisyuk (LinkedIn Corporation)
Liang Zhang (LinkedIn Corporation)
Krishnaram Kenthapadi (LinkedIn Corporation) |
| |
A
Data Science Approach to Understanding Residential Water Contamination
in Flint (Page 1407)
Alex Chojnacki (University of Michigan)
Chengyu Dai (University of Michigan)
Arya Farahi (University of Michigan)
Guangsha Shi (University of Michigan)
Jared Webb (Brigham Young University)
Daniel T. Zhang (University of Michigan)
Jacob Abernethy (University of Michigan)
Eric Schwartz (University of Michigan)
Estimation
of Recent Ancestral Origins of Individuals on a Large Scale (Page
1417)
Ross E. Curtis (AncestryDNA)
Ahna R. Girshick (AncestryDNA)
A
Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled
Experiments (Page
1427)
Pavel Dmitriev (Microsoft Corporation)
Somit Gupta (Microsoft Corporation)
Dong Woo Kim (Microsoft Corporation)
Garnet Vaz (Microsoft Corporation)
A
Century of Science: Globalization of Scientific Collaborations, Citations,
and Innovations (Page
1437)
Yuxiao Dong (Microsoft Research)
Hao Ma (Microsoft Research)
Zhihong Shen (Microsoft Research)
Kuansan Wang (Microsoft Research) |
| |
FIRST:
Fast Interactive Attributed Subgraph Matching (Page
1447)
Boxin Du (Arizona State University)
Si Zhang (Arizona State University)
Nan Cao (Tongji University)
Hanghang Tong (Arizona State University)
Prognosis
and Diagnosis of Parkinson's Disease Using Multi-Task Learning (Page
1457)
Saba Emrani (SAS Institute Inc.)
Anya McGuirk (SAS Institute Inc.)
Wei Xiao (SAS Institute Inc.)
A
Data Mining Framework for Valuing Large Portfolios of Variable Annuities (Page
1467)
Guojun Gan (University of Connecticut)
Jimmy Xiangji Huang (York University)
GELL:
Automatic Extraction of Epidemiological Line Lists from Open Sources (Page
1477)
Saurav Ghosh (Virginia Tech)
Prithwish Chakraborty (Virginia Tech)
Bryan L. Lewis (Virginia Tech)
Maimuna S. Majumder (Massachusetts Institute of Technology & Boston
Children's Hospital)
Emily Cohn (Boston Children's Hospital)
John S. Brownstein (Boston Children's Hospital)
Madhav V. Marathe (Virginia Tech)
Naren Ramakrishnan (Virginia Tech) |
| |
Google
Vizier: A Service for Black-Box Optimization (Page
1487)
Daniel Golovin (Google Research)
Benjamin Solnik (Google Research)
Subhodeep Moitra (Google Research)
Greg Kochanski (Google Research)
John Karro (Google Research)
D. Sculley (Google Research)
Predicting
Clinical Outcomes Across Changing Electronic Health Record Systems (Page
1497)
Jen J. Gong (Massachusetts Institute of Technology)
Tristan Naumann (Massachusetts Institute of Technology)
Peter Szolovits (Massachusetts Institute of Technology)
John V. Guttag (Massachusetts Institute of Technology)
HinDroid:
An Intelligent Android Malware Detection System Based on Structured Heterogeneous
Information Network (Page
1507)
Shifu Hou (West Virginia University)
Yanfang Ye (West Virginia University)
Yangqiu Song (HKUST)
Melih Abdulhayoglu (Comodo Security Solutions, Inc.)
Peeking
at A/B Tests: Why it matters, and what to do about it (Page
1517)
Ramesh Johari (Stanford University)
Pete Koomen (Optimizely, Inc.)
Leonid Pekelis (Optimizely, Inc.)
David Walsh (Stanford University) |
| |
PNP:
Fast Path Ensemble Method for Movie Design (Page
1527)
Danai Koutra (University of Michigan)
Abhilash Dighe (University of Michigan)
Smriti Bhagat (Facebook & Technicolor)
Udi Weinsberg (Facebook & Technicolor)
Stratis Ioannidis (Northeastern University)
Christos Faloutsos (Carnegie Mellon University)
Jean Bolot (Technicolor)
Pharmacovigilance
via Baseline Regularization with Large-Scale Longitudinal Observational
Data (Page 1537)
Zhaobin Kuang (University of Wisconsin-Madison)
Peggy Peissig (Marshfield Clinic)
Vitor Santos Costa (Universidade do Porto)
Richard Maclin (University of Minnesota-Duluth)
David Page (University of Wisconsin-Madison)
FLAP:
An End-to-End Event Log Analysis Platform for System Management (Page
1547)
Tao Li (Nanjing University of Posts and Telecommunications)
Yexi Jiang (Florida International University)
Chunqiu Zeng (Florida International University)
Bin Xia (Nanjing University of Posts and Telecommunications)
Zheng Liu (Nanjing University of Posts and Telecommunications)
Wubai Zhou (Florida International University)
Xiaolong Zhu (Florida International University)
Wentao Wang (Florida International University)
Liang Zhang (Huawei Nanjing Research and Development Center)
Jun Wu (Huawei Nanjing Research and Development Center)
Li Xue (Huawei Nanjing Research and Development Center)
Dewei Bao (Huawei Nanjing Research and Development Center) |
| |
Cascade
Ranking for Operational E-commerce Search (Page
1557)
Shichen Liu (Alibaba Group)
Fei Xiao (Alibaba Group)
Wenwu Ou (Alibaba Group)
Luo Si (Alibaba Group)
Developing
a Comprehensive Framework for Multimodal Feature Extraction (Page
1567)
Quinten McNamara (University of Texas at Austin)
Alejandro De La Vega (University of Texas at Austin)
Tal Yarkoni (University of Texas at Austin)
Deep
Choice Model Using Pointer Networks for Airline Itinerary Prediction (Page
1575)
Alejandro Mottini (Amadeus SAS)
Rodrigo Acuna-Agost (Amadeus SAS)
Compass:
Spatio Temporal Sentiment Analysis of US Election: What Twitter Says! (Page
1585)
Debjyoti Paul (University of Utah)
Feifei Li (University of Utah)
Murali Krishna Teja (University of Utah)
Xin Yu (University of Utah)
Richie Frost (University of Utah) |
| |
Backpage
and Bitcoin: Uncovering Human Traffickers (Page
1595)
Rebecca S. Portnoff (University of California, Berkeley)
Danny Yuxing Huang (University of California, San Diego)
Periwinkle Doerfler (New York University)
Sadia Afroz (ICSI)
Damon McCoy (New York University)
"Not
All Passes Are Created Equal:" Objectively Measuring The Risk and Reward
of Passes in Soccer from Tracking Data (Page
1605)
Paul Power (STATS)
Hector Ruiz (STATS)
Xinyu Wei (STATS)
Patrick Lucey (STATS)
MARAS:
Signaling Multi-Drug Adverse Reactions (Page
1615)
Xiao Qin (Worcester Polytechnic Institute)
Tabassum Kakar (Worcester Polytechnic Institute)
Susmitha Wunnava (Worcester Polytechnic Institute)
Elke A. Rundensteiner (Worcester Polytechnic Institute)
Lei Cao (Massachusetts Institute of Technology)
A
Practical Exploration System for Search Advertising (Page
1625)
Parikshit Shah (Yahoo Research)
Ming Yang (Yahoo)
Sachidanand Alle (Yahoo)
Adwait Ratnaparkhi (Yahoo Research)
Ben Shahshahani (Yahoo Research)
Rohit Chandra (Yahoo) |
| |
MOLIERE:
Automatic Biomedical Hypothesis Generation System (Page
1633)
Justin Sybrandt (Clemson University)
Michael Shtutman (University of South Carolina)
Ilya Safro (Clemson University)
Quick
Access: Building a Smart Experience for Google Drive (Page
1643)
Sandeep Tata (Google USA)
Alexandrin Popescul (Google USA)
Marc Najork (Google USA)
Mike Colagrosso (Google USA)
Julian Gibbons (Google Australia)
Alan Green (Google Australia)
Alexandre Mah (Google Australia)
Michael Smith (Google Australia)
Divanshu Garg (Google Australia)
Cayden Meyer (Google Australia)
Reuben Kan (Google Australia)
The
Simpler The Better: A Unified Approach to Predicting Original Taxi Demands
based on Large-Scale Online Platforms (Page
1653)
Yongxin Tong (Beihang University)
Yuqiang Chen (4Paradigm Inc.)
Zimu Zhou (ETH Zurich)
Lei Chen (Hong Kong University of Science and Technology)
Jie Wang (Didi Research Institute)
Qiang Yang (4Paradigm Inc. & Hong Kong University of Science and Technology)
Jieping Ye (Didi Research Institute)
Weifeng Lv (Beihang University) |
| |
DeepSD:
Generating High Resolution Climate Change Projections through Single Image
Super-Resolution (Page
1663)
Thomas Vandal (Northeastern University)
Evan Kodra (risQ Inc.)
Sangram Ganguly (Bay Area Environmental Research Institute / NASA Ames
Research Center)
Andrew Michaelis (University Corporation, Monterey Bay)
Ramakrishna Nemani (NASA Advanced Supercomputing Division / NASA Ames
Research Center)
Auroop R. Ganguly (Northeastern University)
No
Longer Sleeping with a Bomb: A Duet System for Protecting Urban Safety
from Dangerous Goods (Page
1673)
Jingyuan Wang (Beihang University)
Chao Chen (Beihang University)
Junjie Wu (Beihang University)
Zhang Xiong (Beihang University)
A
Quasi-experimental Estimate of the Impact of P2P Transportation Platforms
on Urban Consumer Patterns (Page
1683)
Zhe Zhang (Carnegie Mellon University)
Beibei Li (Carnegie Mellon University) |
| |
KunPeng:
Parameter Server based Distributed Learning Systems and Its Applications
in Alibaba and Ant Financial (Page
1693)
Jun Zhou (Ant Financial Services Group)
Xiaolong Li (Ant Financial Services Group)
Peilin Zhao (Ant Financial Services Group)
Chaochao Chen (Ant Financial Services Group)
Longfei Li (Ant Financial Services Group)
Xinxing Yang (Ant Financial Services Group)
Qing Cui (Alibaba Cloud)
Jin Yu (Alibaba Cloud)
Xu Chen (Alibaba Cloud)
Yi Ding (Alibaba Cloud)
Yuan Alan Qi (Ant Financial Services Group)
Deep
Embedding Forest: Forest-based Serving with Deep Embedding Features (Page
1703)
Jie Zhu (Microsoft Corporation)
Ying Shan (Microsoft Corporation)
JC Mao (Microsoft Corporation)
Dong Yu (Microsoft Corporation)
Holakou Rahmanian (University of California, Santa Cruz)
Yi Zhang (Microsoft Corporation) |
| |
KDD
2017 Applied Data Science Papers (Poster Papers)
A
Practical Algorithm for Solving the Incoherence Problem of Topic Models
In Industrial Applications (Page
1713)
Amr Ahmed (Google Research)
James Long (Google Research)
Daniel Silva (Google Research)
Yuan Wang (Google Research)
Machine
Learning for Encrypted Malware Traffic Classification: Accounting for
Noisy Labels and Non-Stationarity (Page
1723)
Blake Anderson (Cisco Systems, Inc.)
David McGrew (Cisco Systems, Inc.)
Extremely
Fast Decision Tree Mining for Evolving Data Streams (Page
1733)
Albert Bifet (Telecom ParisTech)
Jiajin Zhang (HUAWEI)
Wei Fan (Baidu Research Big Data Lab)
Cheng He (HUAWEI)
Jianfeng Zhang (HUAWEI)
Jianfeng Qian (Columbia University)
Geoff Holmes (University of Waikato)
Bernhard Pfahringer (University of Waikato) |
| |
Real-Time
Optimization of Web Publisher RTB Revenues (Page
1743)
Pedro Chahuara (XRCE)
Nicolas Grislain (AlephD)
Gregoire Jauvion (AlephD)
Jean-Michel Renders (XRCE)
Customer
Lifetime Value Prediction Using Embeddings (Page
1753)
Benjamin Paul Chamberlain (Imperial College London)
Ângelo Cardoso (ASOS.com)
C.H. Bryan Liu (ASOS.com)
Roberto Pagliari (ASOS.com)
Marc Peter Deisenroth (Imperial College London)
TensorFlow
Estimators: Managing Simplicity vs. Flexibility in High-Level Machine
Learning Frameworks (Page
1763)
Heng-Tze Cheng (Google, Inc.)
Zakaria Haque (Google, Inc.)
Lichan Hong (Google, Inc.)
Mustafa Ispir (Google, Inc.)
Clemens Mewald (Google, Inc)
Illia Polosukhin (Google, Inc.)
Georgios Roumpos (Google, Inc.)
D Sculley (Google, Inc.)
Jamie Smith (Google, Inc.)
David Soergel (Google, Inc.)
Yuan Tang (Uptake Technologies, Inc.)
Philipp Tucker (Google, Inc.)
Martin Wicke (Google, Inc.)
Cassandra Xia (Google, Inc.)
Jianwei Xie (Google, Inc.) |
| |
Learning
Tree-Structured Detection Cascades for Heterogeneous Networks of Embedded
Devices (Page 1773)
Hamid Dadkhahi (University of Massachusetts, Amherst)
Benjamin M. Marlin (University of Massachusetts, Amherst)
AESOP:
Automatic Policy Learning for Predicting and Mitigating Network Service
Impairments (Page
1783)
Supratim Deb (AT&T Labs)
Zihui Ge (AT&T Labs)
Sastry Isukapalli (AT&T Labs)
Sarat Puthenpura (AT&T Labs)
Shobha Venkataraman (AT&T Labs)
He Yan (AT&T Labs)
Jennifer Yates (AT&T Labs)
Automated
Categorization of Onion Sites for Analyzing the Darkweb Ecosystem (Page
1793)
Shalini Ghosh (SRI International)
Ariyam Das (University of California, Los Angeles)
Phil Porras (SRI International)
Vinod Yegneswaran (SRI International)
Ashish Gehani (SRI International) |
| |
Toward
Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster (Page
1803)
Naeemul Hassan (University of Mississippi)
Fatma Arslan (University of Texas at Arlington)
Chengkai Li (University of Texas at Arlington)
Mark Tremayne (University of Texas at Arlington)
An
Efficient Bandit Algorithm for Realtime Multivariate Optimization (Page
1813)
Daniel N. Hill (Amazon.com, Inc.)
Houssam Nassif (Amazon.com, Inc.)
Yi Liu (Amazon.com, Inc.)
Anand Iyer (Amazon.com, Inc.)
S.V.N. Vishwanathan (Amazon.com, Inc. & University of California, Santa
Cruz)
Large
Scale Sentiment Learning with Limited Labels (Page
1823)
Vasileios Iosifidis (Leibniz University Hanover & L3S Research Center)
Eirini Ntoutsi (Leibniz University Hanover & L3S Research Center)
Optimization
Beyond Prediction:Prescriptive Price Optimization (Page
1833)
Shinji Ito (NEC Corporation)
Ryohei Fujimaki (NEC Corpoartion) |
| |
Finding
Precursors to Anomalous Drop in Airspeed During a Flight's Takeoff (Page
1843)
Vijay Manikandan Janakiraman (USRA/ NASA Ames Research Center)
Bryan Matthews (USRA/ NASA Ames Research Center)
Nikunj Oza (NASA Ames Research Center)
Ad
Serving with Multiple KPIs (Page
1853)
Brendan Kitts (PrecisionDemand)
Michael Krishnan (Adap.tv)
Ishadutta Yadav (Adap.tv)
Yongbo Zeng (Adap.tv)
Garrett Badeau (Adap.tv)
Andrew Potter (AOL Platforms)
Sergey Tolkachov (AOL Platforms)
Ethan Thornburg (AOL Platforms)
Satyanarayana Reddy Janga (AOL Platforms)
Discovering
Pollution Sources and Propagation Patterns in Urban Area (Page
1863)
Xiucheng Li (Nanyang Technological University)
Yun Cheng (Air Scientific)
Gao Cong (Nanyang Technological University)
Lisi Chen (Hong Kong Baptist University)
Discovering
Enterprise Concepts Using Spreadsheet Tables (Page
1873)
Keqian Li (University of California, Santa Barbara & Microsoft Research)
Yeye He (Microsoft Research)
Kris Ganjam (Microsoft Research) |
| |
Supporting
Employer Name Normalization at both Entity and Cluster Level (Page
1883)
Qiaoling Liu (CareerBuilder LLC)
Faizan Javed (CareerBuilder LLC)
Vachik S. Dave (Indiana University - Purdue University Indianapolis
& CareerBuilder LLC)
Ankita Joshi (University of Georgia & CareerBuilder LLC)
BDT:
Gradient Boosted Decision Tables for High Accuracy and Scoring Efficiency (Page
1893)
Yin Lou (Airbnb Incorporation)
Mikhail Obukhov (LinkedIn Corporation)
Dipole:
Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent
Neural Networks (Page
1903)
Fenglong Ma (SUNY Buffalo & Xerox)
Radha Chitta (Conduent Labs US)
Jing Zhou (Conduent Labs US)
Quanzeng You (University of Rochester)
Tong Sun (United Technologies Research Center & Xerox)
Jing Gao (SUNY Buffalo)
Internet
Device Graphs (Page
1913)
Matthew Malloy (comScore)
Paul Barford (comScore & University of Wisconsin)
Enis Ceyhun Alp (comScore & University of Wisconsin)
Jonathan Koller (comScore)
Adria Jewell (comScore) |
| |
RUSH!
Targeted Time-limited Coupons via Purchase Forecasts (Page
1923)
Emaad Manzoor (Carnegie Mellon University)
Leman Akoglu (Carnegie Mellon University)
Embedding-based
News Recommendation for Millions of Users (Page
1933)
Shumpei Okura (Yahoo Japan Corporation)
Yukihiro Tagami (Yahoo Japan Corporation)
Shingo Ono (Yahoo Japan Corporation)
Akira Tajima (Yahoo Japan Corporation)
Learning
to Count Mosquitoes for the Sterile Insect Technique (Page
1943)
Yaniv Ovadia (Google Inc.)
Yoni Halpern (Google Inc.)
Dilip Krishnan (Google Inc.)
Josh Livni (Verily Inc.)
Daniel Newburger (Verily Inc.)
Ryan Poplin (Verily Inc.)
Tiantian Zha (Verily Inc.)
D. Sculley (Google Inc.) |
| |
An
Intelligent Customer Care Assistant System for Large-Scale Cellular Network
Diagnosis (Page 1951)
Lujia Pan (Huawei Technologies)
Jianfeng Zhang (Huawei Technologies)
Patrick P. C. Lee (Chinese University of Hong Kong)
Hong Cheng (Chinese University of Hong Kong)
Cheng He (Huawei Technologies)
Caifeng He (Huawei Technologies)
Keli Zhang (Huawei Technologies)
Deep
Design: Product Aesthetics for Heterogeneous Markets (Page
1961)
Yanxin Pan (University of Michigan)
Alexander Burnap (University of Michigan)
Jeffrey Hartley (General Motors)
Richard Gonzalez (University of Michigan)
Panos Y. Papalambros (University of Michigan)
Collecting
and Analyzing Millions of mHealth Data Streams (Page
1971)
Tom Quisel (Evidation Health, Inc.)
Luca Foschini (Evidation Health, Inc.)
Alessio Signorini (Evidation Health, Inc.)
David C. Kale (University of Southern California) |
| |
Dispatch
with Confidence: Integration of Machine Learning, Optimization and Simulation
for Open Pit Mines (Page
1981)
Kosta Ristovski (Hitachi America Ltd)
Chetan Gupta (Hitachi America Ltd)
Kunihiko Harada (Hitachi America Ltd)
Hsiu-Khuern Tang (Hitachi America Ltd)
"The
Leicester City Fairytale?": Utilizing New Soccer Analytics Tools to Compare
Performance in the 15/16 & 16/17 EPL Seasons (Page
1991)
Hector Ruiz (STATS)
Paul Power (STATS)
Xinyu Wei (STATS)
Patrick Lucey (STATS)
Matching
Restaurant Menus to Crowdsourced Food Data: A Scalable Machine Learning
Approach (Page 2001)
Hesam Salehian (Under Armour Connected Fitnes)
Patrick Howell (Under Armour Connected Fitnes)
Chul Lee (Under Armour Connected Fitnes)
The
Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue (Page
2011)
Ashlesh Sharma (Entrupy Inc)
Vidyuth Srinivasan (Entrupy Inc)
Vishal Kanchan (Entrupy Inc)
Lakshminarayanan Subramanian (Entrupy Inc and New York University) |
| |
Automatic
Application Identification from Billions of Files (Page
2021)
Kyle Soska (Carnegie Mellon University)
Chris Gates (Symantec)
Kevin A. Roundy (Symantec)
Nicolas Christin (Carnegie Mellon University)
Learning
to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical
Attention (Page 2031)
Bin Tong (Hitachi, Ltd.)
Martin Klinkigt (Hitachi, Ltd.)
Makoto Iwayama (Hitachi, Ltd.)
Toshihiko Yanase (Hitachi, Ltd.)
Yoshiyuki Kobayashi (Hitachi, Ltd.)
Anshuman Sahu (Hitachi America, Ltd.)
Ravigopal Vennelakanti (Hitachi America, Ltd.)
Multi-view
Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous
Eyes (Page 2041)
Toshimitsu Uesaka (The University of Tokyo)
Kai Morino (The University of Tokyo)
Hiroki Sugiura (The University of Tokyo)
Taichi Kiwaki (The University of Tokyo)
Hiroshi Murata (The University of Tokyo)
Ryo Asaoka (The University of Tokyo)
Kenji Yamanishi (The University of Tokyo) |
| |
Dynamic
Attention Deep Model for Article Recommendation by Learning Human Editors'
Demonstration (Page
2051)
Xuejian Wang (Shanghai Jiao Tong University)
Lantao Yu (Shanghai Jiao Tong University)
Kan Ren (Shanghai Jiao Tong University)
Guanyu Tao (ULU Technologies Inc.)
Weinan Zhang (Shanghai Jiao Tong University)
Yong Yu (Shanghai Jiao Tong University)
Jun Wang (University College London)
A
Hybrid Framework for Text Modeling with Convolutional RNN (Page
2061)
Chenglong Wang (Alibaba Group)
Feijun Jiang (Alibaba Group)
Hongxia Yang (Alibaba Group)
Formative
Essay Feedback Using Predictive Scoring Models (Page
2071)
Bronwyn Woods (Turnitin)
David Adamson (Turnitin)
Shayne Miel (Turnitin)
Elijah Mayfield (Turnitin)
Learning
Temporal State of Diabetes Patients via Combining Behavioral and Demographic
Data (Page 2081)
Houping Xiao (SUNY Buffalo & T.J. Watson Research Center)
Jing Gao (SUNY Buffalo)
Long Vu (IBM T.J. Watson Research Center)
Deepak S. Turaga (IBM T.J. Watson Research Center) |
| |
Local
Algorithm for User Action Prediction Towards Display Ads (Page
2091)
Hongxia Yang (Alibaba Group)
Yada Zhu (IBM Research)
Jingrui He (Arizona State University)
Visual
Search at eBay (Page
2101)
Fan Yang (eBay Inc.)
Ajinkya Kale (eBay Inc.)
Yury Bubnov (eBay Inc.)
Leon Stein (eBay Inc.)
Qiaosong Wang (eBay Inc.)
Hadi Kiapour (eBay Inc.)
Robinson Piramuthu (eBay Inc.)
A
Data-driven Process Recommender Framework (Page
2111)
Sen Yang (Rutgers University)
Xin Dong (Rutgers University)
Leilei Sun (Tsinghua University)
Yichen Zhou (Rutgers University)
Richard A. Farneth (Children's National Medical Center)
Hui Xiong (Rutgers University)
Randall S. Burd (Children's National Medical Center)
Ivan Marsic (Rutgers University) |
| |
Predicting
Optimal Facility Location without Customer Locations (Page
2121)
Emre Yilmaz (Bilkent University)
Sanem Elbasi (Bilkent University)
Hakan Ferhatosmanoglu (Bilkent University)
DeepProbe:
Information Directed Sequence Understanding and Chatbot Design via Recurrent
Neural Networks (Page
2131)
Zi Yin (Stanford University & Microsoft)
Keng-hao Chang (Microsoft)
Ruofei Zhang (Microsoft)
Stock
Price Prediction via Discovering Multi-Frequency Trading Patterns (Page
2141)
Liheng Zhang (University of Central Florida)
Charu Aggarwal (IBM T. J. Watson Research Center)
Guo-Jun Qi (University of Central Florida)
A
Taxi Order Dispatch Model based On Combinatorial Optimization (Page
2151)
Lingyu Zhang (Didi Research Institute, Didi Chuxing)
Tao Hu (Didi Research Institute, Didi Chuxing)
Yue Min (Didi Research Institute, Didi Chuxing)
Guobin Wu (Didi Research Institute, Didi Chuxing)
Junying Zhang (Didi Research Institute, Didi Chuxing)
Pengcheng Feng (Didi Research Institute, Didi Chuxing)
Pinghua Gong (Didi Research Institute, Didi Chuxing)
Jieping Ye (Didi Research Institute, Didi Chuxing) |
| |
Contextual
Spatial Outlier Detection with Metric Learning (Page
2161)
Guanjie Zheng (Pennsylvania State University)
Susan L. Brantley (Pennsylvania State University)
Thomas Lauvaux (Pennsylvania State University)
Zhenhui Li (Pennsylvania State University)
Resolving
the Bias in Electronic Medical Records (Page
2171)
Kaiping Zheng (National University of Singapore)
Jinyang Gao (National University of Singapore)
Kee Yuan Ngiam (National University Health System)
Beng Chin Ooi (National University of Singapore)
Wei Luen James Yip (National University Health System)
STAR:
A System for Ticket Analysis and Resolution (Page
2181)
Wubai Zhou (Florida International University)
Wei Xue (Florida International University)
Ramesh Baral (Florida International University)
Qing Wang (Florida International University)
Chunqiu Zeng (Florida International University)
Tao Li (Florida International University)
Jian Xu (Nanjing University of Science and Technology)
Zheng Liu (Nanjing University of Posts and Telecommunications)
Larisa Shwartz (IBM T.J. Watson Research Center)
Genady Ya. Grabarnik (St. John's University, Queens)
Optimized
Cost per Click in Taobao Display Advertising (Page
2191)
Han Zhu (Alibaba Group)
Junqi Jin (Alibaba Group)
Chang Tan (Alibaba Group)
Fei Pan (Alibaba Group)
Yifan Zeng (Alibaba Group)
Han Li (Alibaba Group)
Kun Gai (Alibaba Group) |
| |
|