Main Page

Table of Contents

Author Index

Author Index


(Return to Top)

Abdulhayoglu, Melih, Comodo Security Solutions, Inc.

HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network (Page 1507)


Abernethy, Jacob, University of Michigan

A Data Science Approach to Understanding Residential Water Contamination in Flint (Page 1407)


(Return to Top)

Acuna-Agost, Rodrigo, Amadeus SAS

Deep Choice Model Using Pointer Networks for Airline Itinerary Prediction (Page 1575)


Adamson, David, Turnitin

Formative Essay Feedback Using Predictive Scoring Models (Page 2071)


Afroz, Sadia, ICSI

Backpage and Bitcoin: Uncovering Human Traffickers (Page 1595)


Agarwal, Aman, Cornell University

Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers (Page 687)


Agarwal, Deepak, LinkedIn

The Future of Artificially Intelligent Assistants (Page 33)


(Return to Top)

Aggarwal, Charu, IBM T. J. Watson Research Center

Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes (Page 495)

Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods (Page 485)

REMIX: Automated Exploration for Interactive Outlier Detection (Page 827)

Similarity Forests (Page 395)

Stock Price Prediction via Discovering Multi-Frequency Trading Patterns (Page 2141)

Unsupervised P2P Rental Recommendations via Integer Programming (Page 165)


Agrawal, Saurabh, University of Minnesota

Tripoles: A New Class of Relationships in Time Series Data (Page 697)


Ahmed, Amr, Google Research

A Practical Algorithm for Solving the Incoherence Problem of Topic Models In Industrial Applications (Page 1713)


Airoldi, Edoardo M., Harvard University

Detecting Network Effects: Randomizing Over Randomized Experiments (Page 1027)


(Return to Top)

Ajilore, Olu, University of Illinoisat Chicago

DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection (Page 747)


Akoglu, Leman, Carnegie Mellon University

RUSH! Targeted Time-limited Coupons via Purchase Forecasts (Page 1923)


Alabi, Daniel, Harvard University

Learning Certifiably Optimal Rule Lists (Page 35)


Albert, Adrian, Massachusetts Institute of Technology

Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale (Page 1357)


Alcorn, Scott, Early Warnings LLC.

A Local Algorithm for Structure-Preserving Graph Cut (Page 655)


Alizadeh, Keivan, Sharif University of Technology

Recurrent Poisson Factorization for Temporal Recommendation (Page 847)


(Return to Top)

Alle, Sachidanand, Yahoo

A Practical Exploration System for Search Advertising (Page 1625)


Alp, Enis Ceyhun, comScore & University of Wisconsin

Internet Device Graphs (Page 1913)


Anchuri, Pranay, NEC Laboratories America

Structural Event Detection from Log Messages (Page 1175)


Anderson, Blake, Cisco Systems, Inc.

Machine Learning for Encrypted Malware Traffic Classification: Accounting for Noisy Labels and Non-Stationarity (Page 1723)


Ang, Lynn, University of Michigan

Contextual Motifs: Increasing the Utility of Motifs using Contextual Data (Page 155)


Angelino, Elaine, University of California, Berkeley

Learning Certifiably Optimal Rule Lists (Page 35)


(Return to Top)

Antikacioglu, Arda, Carnegie Mellon University

Post Processing Recommender Systems for Diversity (Page 707)


Aoki, Raquel Y S, Universidade Federal de Minas Gerais

Luck is Hard to Beat: The Difficulty of Sports Prediction (Page 1367)


Arabzadeh, Ali, Sharif University of Technology

Recurrent Poisson Factorization for Temporal Recommendation (Page 847)


Arimura, Hiroki, Hokkaido University

Statistical Emerging Pattern Mining with Multiple Testing Correction (Page 897)


Ariño de la Rubia, Eduardo, Domino Data Lab

Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? (Page 31)

More than the Sum of its Parts: Building Domino Data Lab (Page 9)


Arslan, Fatma, University of Texas at Arlington

Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster (Page 1803)


(Return to Top)

Asaoka, Ryo, The University of Tokyo

Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes (Page 2041)


Assuncao, Renato M., Universidade Federal de Minas Gerais

Luck is Hard to Beat: The Difficulty of Sports Prediction (Page 1367)


Atluri, Gowtham, University of Cincinnati

Tripoles: A New Class of Relationships in Time Series Data (Page 697)


Avin, Chen, Ben Gurion University of the Negev

Improved Degree Bounds and Full Spectrum Power Laws in Preferential Attachment Networks (Page 45)


Badeau, Garrett, Adap.tv

Ad Serving with Multiple KPIs (Page 1853)


Bahadori, Mohammad Taha, Georgia Institute of Technology

GRAM: Graph-based Attention Model for Healthcare Representation Learning (Page 787)


(Return to Top)

Bai, Lanxiao, University of Illinois, Urbana Champaign

Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation (Page 1245)


Bai, Zilong, University of California, Davis

Unsupervised Network Discovery for Brain Imaging Data (Page 55)


Bao, Dewei, Huawei Nanjing Research and Development Center

FLAP: An End-to-End Event Log Analysis Platform for System Management (Page 1547)


Bao, Jie, Microsoft Research

Planning Bike Lanes based on Sharing-Bikes' Trajectories (Page 1377)


Baral, Ramesh, Florida International University

STAR: A System for Ticket Analysis and Resolution (Page 2181)


Barford, Paul, comScore & University of Wisconsin

Internet Device Graphs (Page 1913)


(Return to Top)

Basu, Soumya, Cornell University

Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers (Page 687)


Bauman, Konstantin, New York University

Aspect Based Recommendations: Recommending Items with the Most Valuable Aspects Based on User Reviews (Page 717)


Baylor, Denis, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Baytas, Inci M., Michigan State University

Patient Subtyping via Time-Aware LSTM Networks (Page 65)

Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates (Page 1195)


Benson, Austin R., Stanford University

Local Higher-Order Graph Clustering (Page 555)


Berg-Kirkpatrick, Taylor, Carnegie Mellon University

Efficient Correlated Topic Modeling with Topic Embedding (Page 225)


(Return to Top)

Berglund, Andy, University of Florida

Mining Big Data in NeuroGenetics to Understand Muscular Dystrophy (Page 11)


Bhagat, Smriti, Facebook & Technicolor

PNP: Fast Path Ensemble Method for Movie Design (Page 1527)


Bifet, Albert, Telecom ParisTech

Extremely Fast Decision Tree Mining for Evolving Data Streams (Page 1733)


Blalock, Davis W., Massachusetts Institute of Technology

Bolt: Accelerated Data Mining with Fast Vector Compression (Page 727)


Bloom, Josh, GE

Industrial Machine Learning (Page 13)


Böhm, Christian, Ludwig-Maximilians-Universität München

Learning from Labeled and Unlabeled Vertices in Networks (Page 1265)

Towards an Optimal Subspace for K-Means (Page 365)


(Return to Top)

Bojchevski, Aleksandar, Technical University of Munich

Robust Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves Latent Embeddings (Page 737)


Boley, Mario, Max Planck Institute for Informatics & Saarland University

Discovering Reliable Approximate Functional Dependencies (Page 355)


Bolot, Jean, Technicolor

PNP: Fast Path Ensemble Method for Movie Design (Page 1527)


Borisyuk, Fedor, LinkedIn Corporation

LiJAR: A System for Job Application Redistribution towards Efficient Career Marketplace (Page 1397)


Boyd, Stephen, Stanford University

Network Inference via the Time-Varying Graphical Lasso (Page 205)

Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data (Page 215)


Brantley, Susan L., Pennsylvania State University

Contextual Spatial Outlier Detection with Metric Learning (Page 2161)


(Return to Top)

Breck, Eric, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Brownstein, John S., Boston Children's Hospital

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)


Brusilovsky, Peter, University of Pittsburgh

Semi-Supervised Techniques for Mining Learning Outcomes and Prerequisites (Page 907)


Bubnov, Yury, eBay Inc.

Visual Search at eBay (Page 2101)


Buchler, Norbou, US Army Research Laboratory

Is the Whole Greater Than the Sum of Its Parts? (Page 295)


Burd, Randall S., Children's National Medical Center

A Data-driven Process Recommender Framework (Page 2111)


(Return to Top)

Burnap, Alexander, University of Michigan

Deep Design: Product Aesthetics for Heterogeneous Markets (Page 1961)


Butts, Carter T., University of California, Irvine

Construction of Directed 2K Graphs (Page 1115)


Candel, Arno, H2O.ai, Inc.

Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? (Page 31)


Cao, Bokai, University of Illinois at Chicago

DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection (Page 747)

Structural Deep Brain Network Mining (Page 475)


Cao, Lei, Massachusetts Institute of Technology

Distributed Local Outlier Detection in Big Data (Page 1225)

MARAS: Signaling Multi-Drug Adverse Reactions (Page 1615)

Scalable Top-n Local Outlier Detection (Page 1235)


Cao, Longbing, University of Technology Sydney

Behavior Informatics to Discover Behavior Insight for Active and Tailored Client Management (Page 15)

Discrete Content-aware Matrix Factorization (Page 325)


(Return to Top)

Cao, Nan, Tongji University

FIRST: Fast Interactive Attributed Subgraph Matching (Page 1447)

Is the Whole Greater Than the Sum of Its Parts? (Page 295)


Cardoso, Ângelo, ASOS.com

Customer Lifetime Value Prediction Using Embeddings (Page 1753)


Carin, Lawrence, Duke University

Evaluating U.S. Electoral Representation with a Joint Statistical Model of Congressional Roll-Calls, Legislative Text, and Voter Registration Data (Page 1205)


Carlson, Kimberly, University of Hawaii Manoa

Incremental Dual-memory LSTM in Land Cover Prediction (Page 867)


Cassidy, Taylor, Army Research Laboratory

MetaPAD: Meta Pattern Discovery from Massive Text Corpora (Page 877)


Caverlee, James, Texas A&M University

Multi-Aspect Streaming Tensor Completion (Page 435)


(Return to Top)

Chahuara, Pedro, XRCE

Real-Time Optimization of Web Publisher RTB Revenues (Page 1743)


Chakrabarti, Soumen, IIT Bombay

Relay-Linking Models for Prominence and Obsolescence in Evolving Networks (Page 1077)


Chakraborty, Prithwish, Virginia Tech

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)


Chamberlain, Benjamin Paul, Imperial College London

Customer Lifetime Value Prediction Using Embeddings (Page 1753)


Chan, Joel, Carnegie Mellon University

Accelerating Innovation Through Analogy Mining (Page 235)


Chan, Po-Wei, University of Illinois at Urbana-Champaign

PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks (Page 425)


(Return to Top)

Chandra, Rohit, Yahoo

A Practical Exploration System for Search Advertising (Page 1625)


Chang, Keng-hao, Microsoft

DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks (Page 2131)


Chang, Xiaojun, Carnegie Mellon University

Robust Top-k Multiclass SVM for Visual Category Recognition (Page 75)


Chang, Yi, Huawei Research America

Convex Factorization Machine for Toxicogenomics Prediction (Page 1215)


Chatterjee, Snigdhansu, University of Minnesota

Tripoles: A New Class of Relationships in Time Series Data (Page 697)


Chawla, Nitesh V., University of Notre Dame

metapath2vec: Scalable Representation Learning for Heterogeneous Networks (Page 135)

Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks (Page 807)


(Return to Top)

Chen, Chao, Beihang University

No Longer Sleeping with a Bomb: A Duet System for Protecting Urban Safety from Dangerous Goods (Page 1673)


Chen, Chaochao, Ant Financial Services Group

KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial (Page 1693)


Chen, Enhong, University of Science and Technology of China

Tracking the Dynamics in Crowdfunding (Page 625)


Chen, Jianhui, Microsoft

Convex Factorization Machine for Toxicogenomics Prediction (Page 1215)


Chen, Jinghui, University of Virginia

Fast Newton Hard Thresholding Pursuit for Sparsity Constrained Nonconvex Optimization (Page 757)


Chen, Lei, Hong Kong University of Science and Technology

The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms (Page 1653)


(Return to Top)

Chen, Lisi, Hong Kong Baptist University

Discovering Pollution Sources and Propagation Patterns in Urban Area (Page 1863)


Chen, Ming-Syan, National Taiwan University

On Finding Socially Tenuous Groups for Online Social Networks (Page 415)


Chen, Robert, Georgia Institute of Technology

LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity (Page 1315)


Chen, Ting, University of California, Los Angeles

On Sampling Strategies for Neural Network-based Collaborative Filtering (Page 767)


Chen, Weizhu, Microsoft Research

ReasoNet: Learning to Stop Reading in Machine Comprehension (Page 1047)


Chen, Xu, Alibaba Cloud

KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial (Page 1693)


(Return to Top)

Chen, Yixin, Washington University in St. Louis

Weisfeiler-Lehman Neural Machine for Link Prediction (Page 575)


Chen, Yu, Rensselaer Polytechnic Institute

KATE: K-Competitive Autoencoder for Text (Page 85)


Chen, Yuqiang, 4Paradigm Inc.

The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms (Page 1653)


Cheng, Heng-Tze, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Cheng, Hong, Chinese University of Hong Kong

An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis (Page 1951)


Cheng, Kewei, Arizona State University

Unsupervised Feature Selection in Signed Social Networks (Page 777)


(Return to Top)

Cheng, Yun, Air Scientific

Discovering Pollution Sources and Propagation Patterns in Urban Area (Page 1863)


Chevalier, Troy, Yahoo Research

Online Ranking with Constraints: A Primal-Dual Algorithm and Applications to Web Traffic-Shaping (Page 405)


Chitta, Radha, Conduent Labs US

Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks (Page 1903)


Choi, Edward, Georgia Institute of Technology

GRAM: Graph-based Attention Model for Healthcare Representation Learning (Page 787)


Chojnacki, Alex, University of Michigan

A Data Science Approach to Understanding Residential Water Contamination in Flint (Page 1407)


Chong, Anthony, IKASI

Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? (Page 31)


(Return to Top)

Christin, Nicolas, Carnegie Mellon University

Automatic Application Identification from Billions of Files (Page 2021)


Cohen, Edith, Google Research

HyperLogLog Hyperextended: Sketches for Concave Sublinear Frequency Statistics (Page 105)


Cohen, Reuven, Technion

A Minimal Variance Estimator for the Cardinality of Big Data Set Intersection (Page 95)


Cohn, Emily, Boston Children's Hospital

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)


Colagrosso, Mike, Google USA

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


Cong, Gao, Nanyang Technological University

Discovering Pollution Sources and Propagation Patterns in Urban Area (Page 1863)


(Return to Top)

Conte, Alessio, University of Pisa

Fast Enumeration of Large k-Plexes (Page 115)


Corbett-Davies, Sam, Stanford University

Algorithmic Decision Making and the Cost of Fairness (Page 797)


Cui, Peng, Tsinghua University

A Temporally Heterogeneous Survival Framework with Application to Social Behavior Dynamics (Page 1295)

Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing (Page 265)

Long Short Memory Process: Modeling Growth Dynamics of Microscopic Social Connectivity (Page 565)


Cui, Qing, Alibaba Cloud

KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial (Page 1693)


Curtis, Ross E., AncestryDNA

Estimation of Recent Ancestral Origins of Individuals on a Large Scale (Page 1417)


Dadkhahi, Hamid, University of Massachusetts, Amherst

Learning Tree-Structured Detection Cascades for Heterogeneous Networks of Embedded Devices (Page 1773)


Dai, Chengyu, University of Michigan

A Data Science Approach to Understanding Residential Water Contamination in Flint (Page 1407)


Dai, Wei, Carnegie Mellon University & Petuum Inc.

PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification (Page 545)


Das, Ariyam, University of California, Los Angeles

Automated Categorization of Onion Sites for Analyzing the Darkweb Ecosystem (Page 1793)


Dau, Hoang Anh, University of California Riverside

Matrix Profile V: A Generic Technique to Incorporate Domain Knowledge into Motif Discovery (Page 125)


Dave, Vachik S., Indiana University - Purdue University Indianapolis & CareerBuilder LLC

Supporting Employer Name Normalization at both Entity and Cluster Level (Page 1883)


(Return to Top)

Davidson, Ian, University of California, Davis

Unsupervised Network Discovery for Brain Imaging Data (Page 55)


Davulcu, Hasan, Arizona State University

A Local Algorithm for Structure-Preserving Graph Cut (Page 655)


De La Vega, Alejandro, University of Texas at Austin

Developing a Comprehensive Framework for Multimodal Feature Extraction (Page 1567)


Deb, Supratim, AT&T Labs

AESOP: Automatic Policy Learning for Predicting and Mitigating Network Service Impairments (Page 1783)


Deisenroth, Marc Peter, Imperial College London

Customer Lifetime Value Prediction Using Embeddings (Page 1753)


Desai, Paritosh, Target

It Takes More than Math and Engineering to Hit the Bullseye with Data (Page 17)


(Return to Top)

Dhillon, Inderjit S., University of Texas at Austin

Communication-Efficient Distributed Block Minimization for Nonlinear Kernel Machines (Page 245)

PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification (Page 545)


Dighe, Abhilash, University of Michigan

PNP: Fast Path Ensemble Method for Movie Design (Page 1527)


Ding, Yi, Alibaba Cloud

KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial (Page 1693)


Dmitriev, Pavel, Microsoft Corporation

A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments (Page 1427)


Doerfler, Periwinkle, New York University

Backpage and Bitcoin: Uncovering Human Traffickers (Page 1595)


Dong, Xin, Rutgers University

A Data-driven Process Recommender Framework (Page 2111)


(Return to Top)

Dong, Yuxiao, Microsoft Research

A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations (Page 1437)

metapath2vec: Scalable Representation Learning for Heterogeneous Networks (Page 135)

Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks (Page 807)


Du, Boxin, Arizona State University

FIRST: Fast Interactive Attributed Subgraph Matching (Page 1447)


Du, Changying, Chinese Academy of Sciences

A Location-Sentiment-Aware Recommender System for Both Home-Town and Out-of-Town Users (Page 1135)


Duan, Weitao, LinkedIn

Detecting Network Effects: Randomizing Over Randomized Experiments (Page 1027)


Dwork, Cynthia, Microsoft Research & Harvard University

What's Fair? (Page 1)

 

(Return to Top to Navigate the KDD'17 Author Index)