Main Page

Table of Contents

Author Index

Author Index


(Return to Top)

Eikmeier, Nicole, Purdue University

Revisiting Power-law Distributions in Spectra of Real World Networks (Page 817)


Elbasi, Sanem, Bilkent University

Predicting Optimal Facility Location without Customer Locations (Page 2121)


(Return to Top)

Emrani, Saba, SAS Institute Inc.

Prognosis and Diagnosis of Parkinson's Disease Using Multi-Task Learning (Page 1457)


Epasto, Alessandro, Google Research

Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters (Page 145)


Faloutsos, Christos, Carnegie Mellon University

DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams (Page 1057)

Long Short Memory Process: Modeling Growth Dynamics of Microscopic Social Connectivity (Page 565)

PNP: Fast Path Ensemble Method for Movie Design (Page 1527)


Fan, Wei, Baidu Research Big Data Lab

Extremely Fast Decision Tree Mining for Evolving Data Streams (Page 1733)


Farahi, Arya, University of Michigan

A Data Science Approach to Understanding Residential Water Contamination in Flint (Page 1407)


Farajtabar, Mehrdad, Georgia Institute of Technology

Recurrent Poisson Factorization for Temporal Recommendation (Page 847)


(Return to Top)

Farneth, Richard A., Children's National Medical Center

A Data-driven Process Recommender Framework (Page 2111)


Farooq, Faisal, IBM

Chairs' Welcome Message


Fayyad, Usama M., Open Insights

Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? (Page 31)

Foreword to the Applied Data Science - Invited Talks Track at KDD-2017 (Page 7)


Feller, Avi, University of California, Berkeley

Algorithmic Decision Making and the Cost of Fairness (Page 797)


Feng, Pengcheng, Didi Research Institute, Didi Chuxing

A Taxi Order Dispatch Model based On Combinatorial Optimization (Page 2151)


Ferhatosmanoglu, Hakan, Bilkent University

Predicting Optimal Facility Location without Customer Locations (Page 2121)


(Return to Top)

Fiedel, Noah, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Firmani, Donatella, Roma Tre University

Fast Enumeration of Large k-Plexes (Page 115)


Foo, Chuan Yu, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Foschini, Luca, Evidation Health, Inc.

Collecting and Analyzing Millions of mHealth Data Streams (Page 1971)


Fouque, Pierre-Alain, IRISA

Anomaly Detection in Streams with Extreme Value Theory (Page 1067)


Fox, Ian, University of Michigan

Contextual Motifs: Increasing the Utility of Motifs using Contextual Data (Page 155)


(Return to Top)

Frost, Richie, University of Utah

Compass: Spatio Temporal Sentiment Analysis of US Election: What Twitter Says! (Page 1585)


Fu, Yanjie, Missouri University of Science and Technology

Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams (Page 335)

Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes (Page 495)

REMIX: Automated Exploration for Interactive Outlier Detection (Page 827)

Unsupervised P2P Rental Recommendations via Integer Programming (Page 165)


Fu, Yanmei, Chinese Academy of Sciences & University of Chinese Academy of Sciences

A Location-Sentiment-Aware Recommender System for Both Home-Town and Out-of-Town Users (Page 1135)


Fujimaki, Ryohei, NEC Corpoartion

Optimization Beyond Prediction:Prescriptive Price Optimization (Page 1833)


Gabel, Moshe, Technion - Israel Institute of Technology

Anarchists, Unite: Practical Entropy Approximation for Distributed Streams (Page 837)


Gai, Kun, Alibaba Group

Optimized Cost per Click in Taobao Display Advertising (Page 2191)


(Return to Top)

Gan, Guojun, University of Connecticut

A Data Mining Framework for Valuing Large Portfolios of Variable Annuities (Page 1467)


Ganguly, Auroop R., Northeastern University

DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution (Page 1663)


Ganguly, Sangram, Bay Area Environmental Research Institute / NASA Ames Research Center

DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution (Page 1663)


Ganjam, Kris, Microsoft Research

Discovering Enterprise Concepts Using Spreadsheet Tables (Page 1873)


Gao, Jianfeng, Microsoft Research

ReasoNet: Learning to Stop Reading in Machine Comprehension (Page 1047)


Gao, Jianxi, Northeastern University

The Co-Evolution Model for Social Network Evolving and Opinion Migration (Page 175)


(Return to Top)

Gao, Jing, SUNY at Buffalo

Collaboratively Improving Topic Discovery and Word Embeddings by Coordinating Global and Local Contexts (Page 535)

Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks (Page 1903)

Learning Temporal State of Diabetes Patients via Combining Behavioral and Demographic Data (Page 2081)

Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources (Page 967)


Gao, Jinyang, National University of Singapore

Resolving the Bias in Electronic Medical Records (Page 2171)


Garg, Divanshu, Google Australia

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


Gates, Chris, Symantec

Automatic Application Identification from Billions of Files (Page 2021)


Ge, Hancheng, Texas A&M University

Multi-Aspect Streaming Tensor Completion (Page 435)


Ge, Yong, University of Arizona

A Context-aware Attention Network for Interactive Question Answering (Page 927)

Discrete Content-aware Matrix Factorization (Page 325)

Prospecting the Career Development of Talents: A Survival Analysis Perspective (Page 917)

Tracking the Dynamics in Crowdfunding (Page 625)


(Return to Top)

Ge, Zihui, AT&T Labs

AESOP: Automatic Policy Learning for Predicting and Mitigating Network Service Impairments (Page 1783)


Gehani, Ashish, SRI International

Automated Categorization of Onion Sites for Analyzing the Darkweb Ecosystem (Page 1793)


Geramifard, Alborz, Amazon

The Future of Artificially Intelligent Assistants (Page 33)


Gerber, James, University of Minnesota

Incremental Dual-memory LSTM in Land Cover Prediction (Page 867)


Ghosh, Saurav, Virginia Tech

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)


Ghosh, Shalini, SRI International

Automated Categorization of Onion Sites for Analyzing the Darkweb Ecosystem (Page 1793)


(Return to Top)

Ghosh, Souvik, LinkedIn

Detecting Network Effects: Randomizing Over Randomized Experiments (Page 1027)


Giannotti, Fosca, ISTI-CNR

Clustering Individual Transactional Data for Masses of Users (Page 195)


Gibbons, Julian, Google Australia

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


Giles, C. Lee, The Pennsylvania State University

Adversary Resistant Deep Neural Networks with an Application to Malware Detection (Page 1145)


Gionis, Aristides, Aalto University

Inferring the Strength of Social Ties: A Community-Driven Approach (Page 1017)


Girshick, Ahna R., AncestryDNA

Estimation of Recent Ancestral Origins of Individuals on a Large Scale (Page 1417)


(Return to Top)

Gjoka, Minas, University of California, Irvine

Construction of Directed 2K Graphs (Page 1115)


Gleich, David F., Purdue University

Local Higher-Order Graph Clustering (Page 555)

Retrospective Higher-Order Markov Processes for User Trails (Page 1185)

Revisiting Power-law Distributions in Spectra of Real World Networks (Page 817)


Goel, Sharad, Stanford University

Algorithmic Decision Making and the Cost of Fairness (Page 797)


Golovin, Daniel, Google Research

Google Vizier: A Service for Black-Box Optimization (Page 1487)


Gong, Jen J., Massachusetts Institute of Technology

Predicting Clinical Outcomes Across Changing Electronic Health Record Systems (Page 1497)


Gong, Pinghua, Didi Research Institute, Didi Chuxing

A Taxi Order Dispatch Model based On Combinatorial Optimization (Page 2151)


(Return to Top)

Gonzalez, Marta C., Massachusetts Institute of Technology

Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale (Page 1357)


Gonzalez, Richard, University of Michigan

Deep Design: Product Aesthetics for Heterogeneous Markets (Page 1961)


Goyal, Amit, Yahoo Research

Convex Factorization Machine for Toxicogenomics Prediction (Page 1215)


Goyal, Pawan, IIT Kharagpur

Relay-Linking Models for Prominence and Obsolescence in Evolving Networks (Page 1077)


Green, Alan, Google Australia

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


Grislain, Nicolas, AlephD

Real-Time Optimization of Web Publisher RTB Revenues (Page 1743)


(Return to Top)

Gu, Bin, University of Texas at Arlington

Groups-Keeping Solution Path Algorithm for Sparse Regression with Automatic Feature Grouping (Page 185)


Gu, Quanquan, University of Virginia

Fast Newton Hard Thresholding Pursuit for Sparsity Constrained Nonconvex Optimization (Page 757)


Gu, Yupeng, University of California, Los Angeles

The Co-Evolution Model for Social Network Evolving and Opinion Migration (Page 175)


Gui, Huan, University of Illinois at Urbana-Champaign

PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks (Page 425)


Guidotti, Riccardo, ISTI-CNR & University of Pisa

Clustering Individual Transactional Data for Masses of Users (Page 195)


Günnemann, Stephan, Technical University of Munich

Robust Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves Latent Embeddings (Page 737)


(Return to Top)

Guo, Wenbo, The Pennsylvania State University

Adversary Resistant Deep Neural Networks with an Application to Malware Detection (Page 1145)


Gupta, Chetan, Hitachi America Ltd

Dispatch with Confidence: Integration of Machine Learning, Optimization and Simulation for Open Pit Mines (Page 1981)


Gupta, Somit, Microsoft Corporation

A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments (Page 1427)


Guttag, John V., Massachusetts Institute of Technology

Bolt: Accelerated Data Mining with Fast Vector Compression (Page 727)

Predicting Clinical Outcomes Across Changing Electronic Health Record Systems (Page 1497)


Haines, Andrew, Yahoo Research

Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking (Page 465)


Hallac, David, Stanford University

Network Inference via the Time-Varying Graphical Lasso (Page 205)

Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data (Page 215)


(Return to Top)

Halpern, Yoni, Google Inc.

Learning to Count Mosquitoes for the Sterile Insect Technique (Page 1943)


Haltom, William, University of Minnesota

Tripoles: A New Class of Relationships in Time Series Data (Page 697)


Han, Jiawei, University of Illinois at Urbana-Champaign

Automatic Synonym Discovery with Knowledge Bases (Page 997)

Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation (Page 1245)

MetaPAD: Meta Pattern Discovery from Massive Text Corpora (Page 877)

PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks (Page 425)

TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams (Page 595)


Han, Wook-Shin, POSTECH

PAMAE: Parallel k-Medoids Clustering with High Accuracy and Efficiency (Page 1087)


Hanratty, Tim, U.S. Army Research Lab

TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams (Page 595)

MetaPAD: Meta Pattern Discovery from Massive Text Corpora (Page 877)


Haque, Zakaria, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


(Return to Top)

Harada, Kunihiko, Hitachi America Ltd

Dispatch with Confidence: Integration of Machine Learning, Optimization and Simulation for Open Pit Mines (Page 1981)


Hartley, Jeffrey, General Motors

Deep Design: Product Aesthetics for Heterogeneous Markets (Page 1961)


Hassan, Naeemul, University of Mississippi

Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster (Page 1803)


Haykal, Salem, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


He, Caifeng, Huawei Technologies

An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis (Page 1951)


He, Cheng, Huawei Technologies

An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis (Page 1951)

Extremely Fast Decision Tree Mining for Evolving Data Streams (Page 1733)


(Return to Top)

He, Daqing, University of Pittsburgh

Semi-Supervised Techniques for Mining Learning Outcomes and Prerequisites (Page 907)


He, Jingrui, Arizona State University

A Local Algorithm for Structure-Preserving Graph Cut (Page 655)

Local Algorithm for User Action Prediction Towards Display Ads (Page 2091)

Multi-task Function-on-function Regression with Co-grouping Structured Sparsity (Page 1255)


He, Junxian, Carnegie Mellon University & Shanghai Jiao Tong University

Efficient Correlated Topic Modeling with Topic Embedding (Page 225)


He, Lifang, Shenzhen University

Structural Deep Brain Network Mining (Page 475)


He, Tianfu, Harbin Institution of Technology

Planning Bike Lanes based on Sharing-Bikes' Trajectories (Page 1377)


He, Yeye, Microsoft Research

Discovering Enterprise Concepts Using Spreadsheet Tables (Page 1873)


(Return to Top)

Heck, Larry, Google

The Future of Artificially Intelligent Assistants (Page 33)


Hill, Daniel N., Amazon.com, Inc.

An Efficient Bandit Algorithm for Realtime Multivariate Optimization (Page 1813)


Hillygus, Sunshine, Duke University

Evaluating U.S. Electoral Representation with a Joint Statistical Model of Congressional Roll-Calls, Legislative Text, and Voter Registration Data (Page 1205)


Ho, Qirong, Petuum, Inc.

Distributed Multi-Task Relationship Learning (Page 937)


Holmes, Geoff, University of Waikato

Extremely Fast Decision Tree Mining for Evolving Data Streams (Page 1733)


Hong, Liangjie, Etsy Inc.

On Sampling Strategies for Neural Network-based Collaborative Filtering (Page 767)


(Return to Top)

Hong, Lichan, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)


Hooi, Bryan, Carnegie Mellon University

DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams (Page 1057)


Hope, Tom, Hebrew University of Jerusalem

Accelerating Innovation Through Analogy Mining (Page 235)


Hosseini, Seyed Abbas, Sharif University of Technology

Recurrent Poisson Factorization for Temporal Recommendation (Page 847)


Hou, Shifu, West Virginia University

HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network (Page 1507)


How, Jonathan P., Massachusetts Institute of Technology

Planning and Learning under Uncertainty: Theory and Practice (Page 19)


(Return to Top)

Howell, Patrick, Under Armour Connected Fitnes

Matching Restaurant Menus to Crowdsourced Food Data: A Scalable Machine Learning Approach (Page 2001)


Hsieh, Cho-Jui, University of California, Davis

Communication-Efficient Distributed Block Minimization for Nonlinear Kernel Machines (Page 245)

Large-scale Collaborative Ranking in Near-Linear Time (Page 515)


Hu, Tao, Didi Research Institute, Didi Chuxing

A Taxi Order Dispatch Model based On Combinatorial Optimization (Page 2151)


Hu, Wenqing, Missouri University of Science and Technology

Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes (Page 495)


Hu, Xia, Texas A&M University & Texas A&M Engineering Experiment Station

Multi-Aspect Streaming Tensor Completion (Page 435)


Hu, Zhiting, Carnegie Mellon University & Petuum Inc.

Efficient Correlated Topic Modeling with Topic Embedding (Page 225)


(Return to Top)

Huan, Jun, University of Kansas

Constructivism Learning: A Learning Paradigm for Transparent Predictive Analytics (Page 285)

Sparse Compositional Local Metric Learning (Page 1097)


Huang, Danny Yuxing, University of California, San Diego

Backpage and Bitcoin: Uncovering Human Traffickers (Page 1595)


Huang, Heng, University of Texas at Arlington

Groups-Keeping Solution Path Algorithm for Sparse Regression with Automatic Feature Grouping (Page 185)


Huang, Jimmy Xiangji, York University

A Data Mining Framework for Valuing Large Portfolios of Variable Annuities (Page 1467)


Huang, Liang-Hao, Academia Sinica

On Finding Socially Tenuous Groups for Online Social Networks (Page 415)


Huang, Po-Sen, Microsoft Research

ReasoNet: Learning to Stop Reading in Machine Comprehension (Page 1047)


(Return to Top)

Huang, Qiming, Purdue University

SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis (Page 857)


Huang, Xiangru, University of Texas at Austin

PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification (Page 545)


Huang, Xiao, Texas A&M University

Multi-Aspect Streaming Tensor Completion (Page 435)


Huang, Ying, Shanghai Jiao Tong University

Efficient Correlated Topic Modeling with Topic Embedding (Page 225)


Huang, Yun, University of Pittsburgh

Semi-Supervised Techniques for Mining Learning Outcomes and Prerequisites (Page 907)


Huq, Aziz, University of Chicago

Algorithmic Decision Making and the Cost of Fairness (Page 797)


(Return to Top)

Ioannidis, Stratis, Northeastern University

PNP: Fast Path Ensemble Method for Movie Design (Page 1527)


Iosifidis, Vasileios, Leibniz University Hanover & L3S Research Center

Large Scale Sentiment Learning with Limited Labels (Page 1823)


Ishihata, Masakazu, Hokkaido University

Statistical Emerging Pattern Mining with Multiple Testing Correction (Page 897)


Ispir, Mustafa, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Isukapalli, Sastry, AT&T Labs

AESOP: Automatic Policy Learning for Predicting and Mitigating Network Service Impairments (Page 1783)


Ito, Shinji, NEC Corporation

Optimization Beyond Prediction:Prescriptive Price Optimization (Page 1833)


(Return to Top)

Iwayama, Makoto, Hitachi, Ltd.

Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention (Page 2031)


Iyer, Anand, Amazon.com, Inc.

An Efficient Bandit Algorithm for Realtime Multivariate Optimization (Page 1813)


Jain, Anil K., Michigan State University

Patient Subtyping via Time-Aware LSTM Networks (Page 65)


Jain, Vihan, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Jaiswal, Mamta, University of Michigan

Contextual Motifs: Increasing the Utility of Motifs using Contextual Data (Page 155)


Janakiraman, Vijay Manikandan, USRA/ NASA Ames Research Center

Finding Precursors to Anomalous Drop in Airspeed During a Flight's Takeoff (Page 1843)


(Return to Top)

Janga, Satyanarayana Reddy, AOL Platforms

Ad Serving with Multiple KPIs (Page 1853)


Jauvion, Gregoire, AlephD

Real-Time Optimization of Web Publisher RTB Revenues (Page 1743)


Javed, Faizan, CareerBuilder LLC

Supporting Employer Name Normalization at both Entity and Cluster Level (Page 1883)


Jewell, Adria, comScore

Internet Device Graphs (Page 1913)


Ji, Shuiwang, Washington State University

Multi-Modality Disease Modeling via Collective Deep Matrix Factorization (Page 1155)


Jia, Xiaowei, University of Minnesota

Incremental Dual-memory LSTM in Land Cover Prediction (Page 867)


(Return to Top)

Jiang, Feijun, Alibaba Group

A Hybrid Framework for Text Modeling with Convolutional RNN (Page 2061)


Jiang, Meng, University of Notre Dame

Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing (Page 265)

MetaPAD: Meta Pattern Discovery from Massive Text Corpora (Page 877)


Jiang, Xiaoqian, University of California, San Diego

Federated Tensor Factorization for Computational Phenotyping (Page 887)


Jiang, Yexi, Florida International University

FLAP: An End-to-End Event Log Analysis Platform for System Management (Page 1547)


Jin, Junqi, Alibaba Group

Optimized Cost per Click in Taobao Display Advertising (Page 2191)


Joachims, Thorsten, Cornell University

Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers (Page 687)


(Return to Top)

Johari, Ramesh, Stanford University

Peeking at A/B Tests: Why it matters, and what to do about it (Page 1517)


Johnson, Reid A., University of Notre Dame

Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks (Page 807)


Joshi, Ankita, University of Georgia & CareerBuilder LLC

Supporting Employer Name Normalization at both Entity and Cluster Level (Page 1883)

 

(Return to Top to Navigate the KDD'17 Author Index)