Main Page

Table of Contents

Author Index

Author Index


(Return to Top)

Kadav, Asim, NEC Laboratories America

A Context-aware Attention Network for Interactive Question Answering (Page 927)


Kakar, Tabassum, Worcester Polytechnic Institute

MARAS: Signaling Multi-Drug Adverse Reactions (Page 1615)


(Return to Top)

Kale, Ajinkya, eBay Inc.

Visual Search at eBay (Page 2101)


Kale, David C., University of Southern California

Collecting and Analyzing Millions of mHealth Data Streams (Page 1971)


Kan, Reuben, Google Australia

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


Kanchan, Vishal, Entrupy Inc

The Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue (Page 2011)


Kaplan, Lance M., Army Research Laboratory

MetaPAD: Meta Pattern Discovery from Massive Text Corpora (Page 877)


(Return to Top)

Karpatne, Anuj, University of Minnesota

Big Data in Climate: Opportunities and Challenges for Machine Learning (Page 21)

Tripoles: A New Class of Relationships in Time Series Data (Page 697)


Karro, John, Google Research

Google Vizier: A Service for Black-Box Optimization (Page 1487)


Kaski, Samuel, Aalto University

Convex Factorization Machine for Toxicogenomics Prediction (Page 1215)


Katzir, Liran, Technion

A Minimal Variance Estimator for the Cardinality of Big Data Set Intersection (Page 95)


Kaur, Jasleen, Philips Lighting Research

Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale (Page 1357)


Kenthapadi, Krishnaram, LinkedIn Corporation

LiJAR: A System for Job Application Redistribution towards Efficient Career Marketplace (Page 1397)


(Return to Top)

Keogh, Eamonn, University of California Riverside

Matrix Profile V: A Generic Technique to Incorporate Domain Knowledge into Motif Discovery (Page 125)


Keren, Daniel, Haifa University

Anarchists, Unite: Practical Entropy Approximation for Distributed Streams (Page 837)


Khan, Suleiman A., University of Helsinki

Convex Factorization Machine for Toxicogenomics Prediction (Page 1215)


Khandelwal, Ankush, University of Minnesota

Incremental Dual-memory LSTM in Land Cover Prediction (Page 867)


Khodadadi, Ali, Sharif University of Technology

Recurrent Poisson Factorization for Temporal Recommendation (Page 847)


Kiapour, Hadi, eBay Inc.

Visual Search at eBay (Page 2101)


(Return to Top)

Kim, Dong Woo, Microsoft Corporation

A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments (Page 1427)


Kim, Jisu, Carnegie Mellon University

DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams (Page 1057)


Kim, Yejin, Pohang University of Science and Technology & University of California, San Diego

Federated Tensor Factorization for Computational Phenotyping (Page 887)


Kitts, Brendan, PrecisionDemand

Ad Serving with Multiple KPIs (Page 1853)


Kittur, Aniket, Carnegie Mellon University

Accelerating Innovation Through Analogy Mining (Page 235)


Kiwaki, Taichi, The University of Tokyo

Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes (Page 2041)


(Return to Top)

Kleinberg, Jon, Cornell University

The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables (Page 275)


Klinkigt, Martin, Hitachi, Ltd.

Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention (Page 2031)


Kobayashi, Yoshiyuki, Hitachi, Ltd.

Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention (Page 2031)


Kobren, Ari, University of Massachusetts, Amherst

A Hierarchical Algorithm for Extreme Clustering (Page 255)


Koc, Levent, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


(Return to Top)

Kochanski, Greg, Google Research

Google Vizier: A Service for Black-Box Optimization (Page 1487)


Kodra, Evan, risQ Inc.

DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution (Page 1663)


Koller, Jonathan, comScore

Internet Device Graphs (Page 1913)


Komiyama, Junpei, University of Tokyo

Statistical Emerging Pattern Mining with Multiple Testing Correction (Page 897)


Koo, Chiu Yuen, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Koomen, Pete, Optimizely, Inc.

Peeking at A/B Tests: Why it matters, and what to do about it (Page 1517)


(Return to Top)

Koutra, Danai, University of Michigan

PNP: Fast Path Ensemble Method for Movie Design (Page 1527)


Krishnamurthy, Akshay, University of Massachusetts, Amherst

A Hierarchical Algorithm for Extreme Clustering (Page 255)


Krishnan, Dilip, Google Inc.

Learning to Count Mosquitoes for the Sterile Insect Technique (Page 1943)


Krishnan, Michael, Adap.tv

Ad Serving with Multiple KPIs (Page 1853)


Kuang, Kun, Tsinghua University

Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing (Page 265)


Kuang, Zhaobin, University of Wisconsin-Madison

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data (Page 1537)


(Return to Top)

Kulhman, Caitlin, Worcester Polytechnic Institute

Distributed Local Outlier Detection in Big Data (Page 1225)


Kumar, Vipin, University of Minnesota

Big Data in Climate: Opportunities and Challenges for Machine Learning (Page 21)

Incremental Dual-memory LSTM in Land Cover Prediction (Page 867)

Tripoles: A New Class of Relationships in Time Series Data (Page 697)


Labutov, Igor, Carnegie Mellon University

Semi-Supervised Techniques for Mining Learning Outcomes and Prerequisites (Page 907)


Lakkaraju, Himabindu, Stanford University

The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables (Page 275)


Lalmas, Mounia, Yahoo Research

Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking (Page 465)


Largouet, Christine, IRISA

Anomaly Detection in Streams with Extreme Value Theory (Page 1067)


(Return to Top)

Larus-Stone, Nicholas, Harvard University

Learning Certifiably Optimal Rule Lists (Page 35)


Lattanzi, Silvio, Google Research Zurich

Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters (Page 145)


Lauvaux, Thomas, Pennsylvania State University

Contextual Spatial Outlier Detection with Metric Learning (Page 2161)


Lee, Chul, Under Armour Connected Fitnes

Matching Restaurant Menus to Crowdsourced Food Data: A Scalable Machine Learning Approach (Page 2001)


Lee, Dik Lun, Hong Kong University of Science and Technology

Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks (Page 635)


Lee, Jae-Gil, Korea Advanced Institute of Science and Technology

PAMAE: Parallel k-Medoids Clustering with High Accuracy and Efficiency (Page 1087)


(Return to Top)

Lee, Jeong-Yoon, Microsoft

Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? (Page 31)


Lee, Patrick P. C., Chinese University of Hong Kong

An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis (Page 1951)


Lee, Wang-Chien, The Pennsylvania State University

On Finding Socially Tenuous Groups for Online Social Networks (Page 415)


Lei, Dongming, University of Illinois at Urbana-Champaign

TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams (Page 595)


Leow, Alex D., University of Illinois at Chicago

DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection (Page 747)


Leskovec, Jure, Stanford University

Local Higher-Order Graph Clustering (Page 555)

Network Inference via the Time-Varying Graphical Lasso (Page 205)

The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables (Page 275)

Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data (Page 215)


(Return to Top)

Lew, Lukasz, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Lewis, Bryan L., Virginia Tech

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)


Li, Beibei, Carnegie Mellon University

A Quasi-experimental Estimate of the Impact of P2P Transportation Platforms on Urban Consumer Patterns (Page 1683)


Li, Bo, Tsinghua Univeristy

Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing (Page 265)


Li, Chengkai, University of Texas at Arlington

Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster (Page 1803)


Li, Feifei, University of Utah

Compass: Spatio Temporal Sentiment Analysis of US Election: What Twitter Says! (Page 1585)


(Return to Top)

Li, Han, Alibaba Group

Optimized Cost per Click in Taobao Display Advertising (Page 2191)


Li, Huayu, University of North Carolina, Charlotte

A Context-aware Attention Network for Interactive Question Answering (Page 927)

Prospecting the Career Development of Talents: A Survival Analysis Perspective (Page 917)

Tracking the Dynamics in Crowdfunding (Page 625)


Li, Jianda, Hong Kong University of Science and Technology

Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks (Page 635)


Li, Jundong, Arizona State University

Unsupervised Feature Selection in Signed Social Networks (Page 777)


Li, Keqian, University of California, Santa Barbara & Microsoft Research

Discovering Enterprise Concepts Using Spreadsheet Tables (Page 1873)


Li, Liangyue, Arizona State University

Is the Whole Greater Than the Sum of Its Parts? (Page 295)


(Return to Top)

Li, Longfei, Ant Financial Services Group

KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial (Page 1693)


Li, Ping, Rutgers University

Linearized GMM Kernels and Normalized Random Fourier Features (Page 315)


Li, Qi, SUNY Buffalo

Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources (Page 967)


Li, Qiao, Rutgers University

Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion (Page 957)


Li, Tao, Nanjing University of Posts and Telecommunications

FLAP: An End-to-End Event Log Analysis Platform for System Management (Page 1547)

STAR: A System for Ticket Analysis and Resolution (Page 2181)


Li, Xiaoli, University of Kansas

Constructivism Learning: A Learning Paradigm for Transparent Predictive Analytics (Page 285)


(Return to Top)

Li, Xiaolong, Ant Financial Services Group

KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial (Page 1693)


Li, Xiaopeng, Hong Kong University of Science and Technology

Collaborative Variational Autoencoder for Recommender Systems (Page 305)


Li, Xiucheng, Nanyang Technological University

Discovering Pollution Sources and Propagation Patterns in Urban Area (Page 1863)


Li, Yaliang, SUNY at Buffalo & Baidu Research Big Data Lab

Collaboratively Improving Topic Discovery and Word Embeddings by Coordinating Global and Local Contexts (Page 535)


Li, Yanhua, WPI

Planning Bike Lanes based on Sharing-Bikes' Trajectories (Page 1377)


Li, Zhenguo, Huawei Noah's Ark Lab

Graph Edge Partitioning via Neighborhood Heuristic (Page 605)


(Return to Top)

Li, Zhenhui, Pennsylvania State University

Contextual Spatial Outlier Detection with Metric Learning (Page 2161)

Structural Event Detection from Log Messages (Page 1175)


Lian, Defu, University of Electronic Science and Technology of China

Discrete Content-aware Matrix Factorization (Page 325)


Lian, Wenzhao, Vicarious

Convex Factorization Machine for Toxicogenomics Prediction (Page 1215)


Liess, Stefan, Univertsity of Minnesota

Tripoles: A New Class of Relationships in Time Series Data (Page 697)


Lin, Kaixiang, Michigan State University

Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates (Page 1195)


Liu, Bing, University of Illinois at Chicago (UIC)

Aspect Based Recommendations: Recommending Items with the Most Valuable Aspects Based on User Reviews (Page 717)


(Return to Top)

Liu, C.H. Bryan, ASOS.com

Customer Lifetime Value Prediction Using Embeddings (Page 1753)


Liu, Chuanren, Drexel University

Point-of-Interest Demand Modeling with Human Mobility Patterns (Page 947)

Randomization or Condensation? Linear-Cost Matrix Sketching Via Cascaded Compression Sampling (Page 615)


Liu, Guannan, Beihang University

Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes (Page 495)

Unsupervised P2P Rental Recommendations via Integer Programming (Page 165)


Liu, Guodong, University of Texas at Arlington

Groups-Keeping Solution Path Algorithm for Sparse Regression with Automatic Feature Grouping (Page 185)


Liu, Huan, Arizona State University

Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods (Page 485)

Unsupervised Feature Selection in Signed Social Networks (Page 777)


Liu, Junming, Rutgers University

Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams (Page 335)

Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion (Page 957)


(Return to Top)

Liu, Liyuan, University of Illinois at Urbana-Champaign

TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams (Page 595)


Liu, Qi, University of Science and Technology of China

Tracking the Dynamics in Crowdfunding (Page 625)


Liu, Qiaoling, CareerBuilder LLC

Supporting Employer Name Normalization at both Entity and Cluster Level (Page 1883)


Liu, Qin, Huawei Noah's Ark Lab & Chinese University of Hong Kong

Graph Edge Partitioning via Neighborhood Heuristic (Page 605)


Liu, Rui, University of Electronic Science and Technology of China

Discrete Content-aware Matrix Factorization (Page 325)


Liu, Shichen, Alibaba Group

Cascade Ranking for Operational E-commerce Search (Page 1557)


(Return to Top)

Liu, Sulin, Nanyang Technological University, Singapore

Distributed Multi-Task Relationship Learning (Page 937)


Liu, Xue, McGill University

Adversary Resistant Deep Neural Networks with an Application to Malware Detection (Page 1145)


Liu, Yanchi, Rutgers University

Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion (Page 957)

Point-of-Interest Demand Modeling with Human Mobility Patterns (Page 947)


Liu, Yi, Amazon.com, Inc.

An Efficient Bandit Algorithm for Realtime Multivariate Optimization (Page 1813)


Liu, Zheng, Nanjing University of Posts and Telecommunications

FLAP: An End-to-End Event Log Analysis Platform for System Management (Page 1547)

STAR: A System for Ticket Analysis and Resolution (Page 2181)


Livni, Josh, Verily Inc.

Learning to Count Mosquitoes for the Sterile Insect Technique (Page 1943)


(Return to Top)

Long, James, Google Research

A Practical Algorithm for Solving the Incoherence Problem of Topic Models In Industrial Applications (Page 1713)


Lotker, Zvi, Ben Gurion University of the Negev

Improved Degree Bounds and Full Spectrum Power Laws in Preferential Attachment Networks (Page 45)


Lou, Yin, Airbnb Incorporation

BDT: Gradient Boosted Decision Tables for High Accuracy and Scoring Efficiency (Page 1893)


Lu, Chun-Ta, University of Illinois at Chicago

Structural Deep Brain Network Mining (Page 475)


Lu, Jian, Nanjing University

HoORaYs: High-order Optimization of Rating Distance for Recommender Systems (Page 525)


Lu, Xinjiang, Northwestern Polytechnical University

Point-of-Interest Demand Modeling with Human Mobility Patterns (Page 947)


(Return to Top)

Lucey, Patrick, STATS

"Not All Passes Are Created Equal:" Objectively Measuring The Risk and Reward of Passes in Soccer from Tracking Data (Page 1605)

"The Leicester City Fairytale?": Utilizing New Soccer Analytics Tools to Compare Performance in the 15/16 & 16/17 EPL Seasons (Page 1991)


Ludwig, Jens, University of Chicago

The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables (Page 275)


Luo, Jiebo, University of Rochester

Mixture Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting (Page 987)


Luo, Ping, Chinese Academy of Sciences & University of Chinese Academy of Sciences

Small Batch or Large Batch? Gaussian Walk with Rebound Can Teach (Page 1275)


Luo, Tingjin, National University of Defense Technology

Functional Annotation of Human Protein Coding Isoforms via Non-convex Multi-Instance Learning (Page 345)


Lv, Weifeng, Beihang University

The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms (Page 1653)


(Return to Top)

Ma, Fenglong, SUNY Buffalo & Xerox

Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks (Page 1903)

Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources (Page 967)


Ma, Hao, Microsoft Research

A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations (Page 1437)


Maclin, Richard, University of Minnesota-Duluth

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data (Page 1537)


Mah, Alexandre, Google Australia

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


Majumder, Maimuna S., Massachusetts Institute of Technology & Boston Children's Hospital

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)


Malloy, Matthew, comScore

Internet Device Graphs (Page 1913)


(Return to Top)

Mamitsuka, Hiroshi, Kyoto University & Aalto University

Convex Factorization Machine for Toxicogenomics Prediction (Page 1215)


Mandros, Panagiotis, Max Planck Institute for Informatics & Saarland University

Discovering Reliable Approximate Functional Dependencies (Page 355)


Manzoor, Emaad, Carnegie Mellon University

RUSH! Targeted Time-limited Coupons via Purchase Forecasts (Page 1923)


Mao, JC, Microsoft Corporation

Deep Embedding Forest: Forest-based Serving with Deep Embedding Features (Page 1703)


Marathe, Madhav V., Virginia Tech

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)


Markopoulou, Athina, University of California, Irvine

Construction of Directed 2K Graphs (Page 1115)


(Return to Top)

Marlin, Benjamin M., University of Massachusetts, Amherst

Learning Tree-Structured Detection Cascades for Heterogeneous Networks of Embedded Devices (Page 1773)


Marsic, Ivan, Rutgers University

A Data-driven Process Recommender Framework (Page 2111)


Matkovic, Yves, Technical University of Munich

Robust Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves Latent Embeddings (Page 737)


Matthews, Bryan, USRA/ NASA Ames Research Center

Finding Precursors to Anomalous Drop in Airspeed During a Flight's Takeoff (Page 1843)


Matwin, Stan, Dalhousie University

Chairs' Welcome Message


Maurus, Samuel, Technical University of Munich

Let's See Your Digits: Anomalous-State Detection using Benford's Law (Page 977)


(Return to Top)

Mautz, Dominik, Ludwig-Maximilians-Universität München

Learning from Labeled and Unlabeled Vertices in Networks (Page 1265)

Towards an Optimal Subspace for K-Means (Page 365)


Mayfield, Elijah, Turnitin

Formative Essay Feedback Using Predictive Scoring Models (Page 2071)


Mazumdar, Mainak, Nielsen

Addressing Challenges with Big Data for Media Measurement (Page 23)


McCallum, Andrew, University of Massachusetts, Amherst

A Hierarchical Algorithm for Extreme Clustering (Page 255)


McCoy, Damon, New York University

Backpage and Bitcoin: Uncovering Human Traffickers (Page 1595)


McGrew, David, Cisco Systems, Inc.

Machine Learning for Encrypted Malware Traffic Classification: Accounting for Noisy Labels and Non-Stationarity (Page 1723)


(Return to Top)

McGuirk, Anya, SAS Institute Inc.

Prognosis and Diagnosis of Parkinson's Disease Using Multi-Task Learning (Page 1457)


McNamara, Quinten, University of Texas at Austin

Developing a Comprehensive Framework for Multimodal Feature Extraction (Page 1567)


Mei, Qiaozhu, University of Michigan

End-to-end Learning for Short Text Expansion (Page 1105)


Meng, Chuishi, SUNY Buffalo

Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources (Page 967)


Mewald, Clemens, Google, Inc

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Meyer, Cayden, Google Australia

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


(Return to Top)

Michaelis, Andrew, University Corporation, Monterey Bay

DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution (Page 1663)


Miel, Shayne, Turnitin

Formative Essay Feedback Using Predictive Scoring Models (Page 2071)


Miller, Renée J., University of Toronto

The Future of Data Integration (Page 3)


Min, Martin Renqiang, NEC Laboratories America

A Context-aware Attention Network for Interactive Question Answering (Page 927)


Min, Yue, Didi Research Institute, Didi Chuxing

A Taxi Order Dispatch Model based On Combinatorial Optimization (Page 2151)


Minato, Shin-ichi, Hokkaido University

Statistical Emerging Pattern Mining with Multiple Testing Correction (Page 897)


(Return to Top)

Ming, Jingci, Rutgers University

Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams (Page 335)

Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion (Page 957)


Modi, Akshay Naresh, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Moitra, Subhodeep, Google Research

Google Vizier: A Service for Black-Box Optimization (Page 1487)


Monath, Nicholas, University of Massachusetts, Amherst

A Hierarchical Algorithm for Extreme Clustering (Page 255)


Monreale, Anna, University of Pisa

Clustering Individual Transactional Data for Masses of Users (Page 195)


Mordente, Caterina, Be Think Solve Execute

Fast Enumeration of Large k-Plexes (Page 115)


(Return to Top)

Morino, Kai, The University of Tokyo

Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes (Page 2041)


Mottini, Alejandro, Amadeus SAS

Deep Choice Model Using Pointer Networks for Airline Itinerary Prediction (Page 1575)


Mukherjee, Animesh, IIT Kharagpur

Relay-Linking Models for Prominence and Obsolescence in Evolving Networks (Page 1077)


Mullainathan, Sendhil, Harvard University

The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables (Page 275)


Murata, Hiroshi, The University of Tokyo

Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes (Page 2041)


Muthukrishnan, Muthu, Rutgers University

The Future of Artificially Intelligent Assistants (Page 33)


(Return to Top)

Nahum, Yinon, Weizmann Institute of Science

Improved Degree Bounds and Full Spectrum Power Laws in Preferential Attachment Networks (Page 45)


Najork, Marc, Google USA

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


Nakamura, Taiga, IBM Almaden Research Center

Small Batch or Large Batch? Gaussian Walk with Rebound Can Teach (Page 1275)


Nanni, Mirco, ISTI-CNR

Clustering Individual Transactional Data for Masses of Users (Page 195)


Nassif, Houssam, Amazon.com, Inc.

An Efficient Bandit Algorithm for Realtime Multivariate Optimization (Page 1813)


Naumann, Tristan, Massachusetts Institute of Technology

Predicting Clinical Outcomes Across Changing Electronic Health Record Systems (Page 1497)


(Return to Top)

Nayak, Guruprasad, University of Minnesota

Incremental Dual-memory LSTM in Land Cover Prediction (Page 867)


Nemani, Ramakrishna, NASA Advanced Supercomputing Division / NASA Ames Research Center

DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution (Page 1663)


Newburger, Daniel, Verily Inc.

Learning to Count Mosquitoes for the Sterile Insect Technique (Page 1943)


Ngiam, Kee Yuan, National University Health System

Resolving the Bias in Electronic Medical Records (Page 2171)


Nicholas, Charles, University of Maryland, Baltimore County

An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance (Page 1007)


Nishibayashi, Takashi, VOYAGE GROUP, Inc.

Statistical Emerging Pattern Mining with Multiple Testing Correction (Page 897)


(Return to Top)

Ntoutsi, Eirini, Leibniz University Hanover & L3S Research Center

Large Scale Sentiment Learning with Limited Labels (Page 1823)


Obukhov, Mikhail, LinkedIn Corporation

BDT: Gradient Boosted Decision Tables for High Accuracy and Scoring Efficiency (Page 1893)


Okura, Shumpei, Yahoo Japan Corporation

Embedding-based News Recommendation for Millions of Users (Page 1933)


Ono, Shingo, Yahoo Japan Corporation

Embedding-based News Recommendation for Millions of Users (Page 1933)


Ooi, Beng Chin, National University of Singapore

Resolving the Bias in Electronic Medical Records (Page 2171)


Ororbia II, Alexander G., The Pennsylvania State University

Adversary Resistant Deep Neural Networks with an Application to Malware Detection (Page 1145)


(Return to Top)

Ou, Wenwu, Alibaba Group

Cascade Ranking for Operational E-commerce Search (Page 1557)


Ovadia, Yaniv, Google Inc.

Learning to Count Mosquitoes for the Sterile Insect Technique (Page 1943)


Oza, Nikunj, NASA Ames Research Center

Finding Precursors to Anomalous Drop in Airspeed During a Flight's Takeoff (Page 1843)


P. Saverese, Pedro H., Federal University of Rio de Janeiro

struc2vec: Learning Node Representations from Structural Identity (Page 385)


Paes Leme, Renato, Google Research

Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters (Page 145)


Paffenroth, Randy C., Worcester Polytechnic Institute

Anomaly Detection with Robust Deep Autoencoders (Page 665)


(Return to Top)

Pafka, Szilárd, Epoch

Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? (Page 31)

Machine Learning Software in Practice: Quo Vadis? (Page 25)


Page, David, University of Wisconsin-Madison

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data (Page 1537)


Pagliari, Roberto, ASOS.com

Customer Lifetime Value Prediction Using Embeddings (Page 1753)


Pan, Fei, Alibaba Group

Optimized Cost per Click in Taobao Display Advertising (Page 2191)


Pan, Lujia, Huawei Technologies

An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis (Page 1951)


Pan, Sinno Jialin, Nanyang Technological University, Singapore

Distributed Multi-Task Relationship Learning (Page 937)


(Return to Top)

Pan, Yanxin, University of Michigan

Deep Design: Product Aesthetics for Heterogeneous Markets (Page 1961)


Papalambros, Panos Y., University of Michigan

Deep Design: Product Aesthetics for Heterogeneous Markets (Page 1961)


Papalexakis, Evangelos E., University of California, Riverside

SPARTan: Scalable PARAFAC2 for Large & Sparse Data (Page 375)


Parekh, Rajesh, Facebook

Designing AI at Scale to Power Everyday Life (Page 27)


Park, Youngsuk, Stanford University

Network Inference via the Time-Varying Graphical Lasso (Page 205)


Parthasarathy, Srinivasan, IBM T. J. Watson Research Center

REMIX: Automated Exploration for Interactive Outlier Detection (Page 827)

Visualizing Attributed Graphs via Terrain Metaphor (Page 1325)


(Return to Top)

Patrignani, Maurizio, Roma Tre University

Fast Enumeration of Large k-Plexes (Page 115)


Paul, Debjyoti, University of Utah

Compass: Spatio Temporal Sentiment Analysis of US Election: What Twitter Says! (Page 1585)


Pedreschi, Dino, University of Pisa

Clustering Individual Transactional Data for Masses of Users (Page 195)


Peissig, Peggy, Marshfield Clinic

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data (Page 1537)


Pekelis, Leonid, Optimizely, Inc.

Peeking at A/B Tests: Why it matters, and what to do about it (Page 1517)


Peleg, David, Weizmann Institute of Science

Improved Degree Bounds and Full Spectrum Power Laws in Preferential Attachment Networks (Page 45)


(Return to Top)

Perros, Ioakeim, Georgia Institute of Technology

SPARTan: Scalable PARAFAC2 for Large & Sparse Data (Page 375)


Pfahringer, Bernhard, University of Waikato

Extremely Fast Decision Tree Mining for Evolving Data Streams (Page 1733)


Phillips, Jeff M., University of Utah

Coresets for Kernel Regression (Page 645)


Pierson, Emma, Stanford University

Algorithmic Decision Making and the Cost of Fairness (Page 797)


Piramuthu, Robinson, eBay Inc.

Visual Search at eBay (Page 2101)


Piscitello, Andrea, University of Illinois at Chicago

DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection (Page 747)


(Return to Top)

Plant, Claudia, University of Vienna

Learning from Labeled and Unlabeled Vertices in Networks (Page 1265)

Let's See Your Digits: Anomalous-State Detection using Benford's Law (Page 977)

Towards an Optimal Subspace for K-Means (Page 365)


Polosukhin, Illia, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)


Polyzotis, Neoklis, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Pop-Busui, Rodica, University of Michigan

Contextual Motifs: Increasing the Utility of Motifs using Contextual Data (Page 155)


Popescul, Alexandrin, Google USA

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


Poplin, Ryan, Verily Inc.

Learning to Count Mosquitoes for the Sterile Insect Technique (Page 1943)


(Return to Top)

Porras, Phil, SRI International

Automated Categorization of Onion Sites for Analyzing the Darkweb Ecosystem (Page 1793)


Portnoff, Rebecca S., University of California, Berkeley

Backpage and Bitcoin: Uncovering Human Traffickers (Page 1595)


Potere, David, Tellus Laboratories

Spaceborne Data Enters the Mainstream (Page 29)


Potter, Andrew, AOL Platforms

Ad Serving with Multiple KPIs (Page 1853)


Pouget-Abadie, Jean, Harvard University

Detecting Network Effects: Randomizing Over Randomized Experiments (Page 1027)


Power, Paul, STATS

"Not All Passes Are Created Equal:" Objectively Measuring The Risk and Reward of Passes in Soccer from Tracking Data (Page 1605)

"The Leicester City Fairytale?": Utilizing New Soccer Analytics Tools to Compare Performance in the 15/16 & 16/17 EPL Seasons (Page 1991)


(Return to Top)

Puthenpura, Sarat, AT&T Labs

AESOP: Automatic Policy Learning for Predicting and Mitigating Network Service Impairments (Page 1783)


Qi, Guo-Jun, University of Central Florida

Mixture Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting (Page 987)

Stock Price Prediction via Discovering Multi-Frequency Trading Patterns (Page 2141)


Qi, Yuan Alan, Ant Financial Services Group

KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial (Page 1693)


Qian, Jianfeng, Columbia University

Extremely Fast Decision Tree Mining for Evolving Data Streams (Page 1733)


Qin, Xiao, Worcester Polytechnic Institute

MARAS: Signaling Multi-Drug Adverse Reactions (Page 1615)


Qiu, Shang, University of Michigan

Functional Annotation of Human Protein Coding Isoforms via Non-convex Multi-Instance Learning (Page 345)


(Return to Top)

Qu, Meng, University of Illinois at Urbana-Champaign

Automatic Synonym Discovery with Knowledge Bases (Page 997)


Quisel, Tom, Evidation Health, Inc.

Collecting and Analyzing Millions of mHealth Data Streams (Page 1971)


R. Figueiredo, Daniel, Federal University of Rio de Janeiro

struc2vec: Learning Node Representations from Structural Identity (Page 385)


R. Ribeiro, Leonardo F., Federal University of Rio de Janeiro

struc2vec: Learning Node Representations from Structural Identity (Page 385)


Rabiee, Hamid R., Sharif University of Technology

Recurrent Poisson Factorization for Temporal Recommendation (Page 847)


Raff, Edward, Laboratory for Physical Sciences

An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance (Page 1007)


Ragin, Ann B., Northwestern University

Structural Deep Brain Network Mining (Page 475)


(Return to Top)

Rahmanian, Holakou, University of California, Santa Cruz

Deep Embedding Forest: Forest-based Serving with Deep Embedding Features (Page 1703)


Ramakrishnan, Naren, Virginia Tech

GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources (Page 1477)


Ramesh, Sukriti, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Ratnaparkhi, Adwait, Yahoo Research

A Practical Exploration System for Search Advertising (Page 1625)


Ravi, R, Carnegie Mellon University

Post Processing Recommender Systems for Diversity (Page 707)


Ravikumar, Pradeep, Carnegie Mellon University

PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification (Page 545)


Ren, Kan, Shanghai Jiao Tong University

Dynamic Attention Deep Model for Article Recommendation by Learning Human Editors' Demonstration (Page 2051)


(Return to Top)

Ren, Xiang, University of Illinois at Urbana-Champaign

Automatic Synonym Discovery with Knowledge Bases (Page 997)

MetaPAD: Meta Pattern Discovery from Massive Text Corpora (Page 877)


Ren, Yong, Futurewei Tech. Inc

Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams (Page 335)


Renders, Jean-Michel, XRCE

Real-Time Optimization of Web Publisher RTB Revenues (Page 1743)


Ristovski, Kosta, Hitachi America Ltd

Dispatch with Confidence: Integration of Machine Learning, Optimization and Simulation for Open Pit Mines (Page 1981)


Roumpos, Georgios, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)


Roundy, Kevin A., Symantec

Automatic Application Identification from Billions of Files (Page 2021)


(Return to Top)

Roy, Sudip, Google Inc.

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform (Page 1387)


Rozenshtein, Polina, Aalto University

Inferring the Strength of Social Ties: A Community-Driven Approach (Page 1017)


Ruan, Sijie, Xidian University

Planning Bike Lanes based on Sharing-Bikes' Trajectories (Page 1377)


Rudin, Cynthia, Duke University

Learning Certifiably Optimal Rule Lists (Page 35)

Optimized Risk Scores (Page 1125)


Ruiz, Hector, STATS

"Not All Passes Are Created Equal:" Objectively Measuring The Risk and Reward of Passes in Soccer from Tracking Data (Page 1605)

"The Leicester City Fairytale?": Utilizing New Soccer Analytics Tools to Compare Performance in the 15/16 & 16/17 EPL Seasons (Page 1991)


Rundensteiner, Elke, Worcester Polytechnic Institute

Distributed Local Outlier Detection in Big Data (Page 1225)

MARAS: Signaling Multi-Drug Adverse Reactions (Page 1615)

Scalable Top-n Local Outlier Detection (Page 1235)


Ryan, Kelly, University of Michigan

DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection (Page 747)


(Return to Top)

Safro, Ilya, Clemson University

MOLIERE: Automatic Biomedical Hypothesis Generation System (Page 1633)


Sahu, Anshuman, Hitachi America, Ltd.

Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention (Page 2031)


Saint-Jacques, Guillaume, Massachusetts Institute of Technology

Detecting Network Effects: Randomizing Over Randomized Experiments (Page 1027)


Salehian, Hesam, Under Armour Connected Fitnes

Matching Restaurant Menus to Crowdsourced Food Data: A Scalable Machine Learning Approach (Page 2001)


Santos Costa, Vitor, Universidade do Porto

Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data (Page 1537)


Sarkar, Rajdeep, IIT Kharagpur

Relay-Linking Models for Prominence and Obsolescence in Evolving Networks (Page 1077)


Sathe, Saket, IBM T. J. Watson Research Center

Similarity Forests (Page 395)


(Return to Top)

Saveski, Martin, Massachusetts Institute of Technology

Detecting Network Effects: Randomizing Over Randomized Experiments (Page 1027)


Schnabel, Tobias, Cornell University

Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers (Page 687)


Scholtes, Ingo, ETH Zürich

When is a Network a Network? Multi-Order Graphical Model Selection in Pathways and Temporal Networks (Page 1037)


Schuster, Assaf, Technion - Israel Institute of Technology

Anarchists, Unite: Practical Entropy Approximation for Distributed Streams (Page 837)


Schwartz, Eric, University of Michigan

A Data Science Approach to Understanding Residential Water Contamination in Flint (Page 1407)


Sculley, D, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)

Google Vizier: A Service for Black-Box Optimization (Page 1487)

Learning to Count Mosquitoes for the Sterile Insect Technique (Page 1943)


Searles, Elizabeth, Children's Healthcare Of Atlanta

SPARTan: Scalable PARAFAC2 for Large & Sparse Data (Page 375)


Seltzer, Margo, Harvard University

Learning Certifiably Optimal Rule Lists (Page 35)


(Return to Top)

Shah, Parikshit, Yahoo Research

A Practical Exploration System for Search Advertising (Page 1625)

Online Ranking with Constraints: A Primal-Dual Algorithm and Applications to Web Traffic-Shaping (Page 405)


Shahaf, Dafna, Hebrew University of Jerusalem

Accelerating Innovation Through Analogy Mining (Page 235)


Shahshahani, Ben, Yahoo Research

A Practical Exploration System for Search Advertising (Page 1625)


Shan, Ying, Microsoft Corporation

Deep Embedding Forest: Forest-based Serving with Deep Embedding Features (Page 1703)


Shang, Jingbo, University of Illinois at Urbana-Champaign

MetaPAD: Meta Pattern Discovery from Massive Text Corpora (Page 877)


Sharma, Ashlesh, Entrupy Inc

The Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue (Page 2011)


(Return to Top)

Sharpnack, James, University of California, Davis

Large-scale Collaborative Ranking in Near-Linear Time (Page 515)


She, James, Hong Kong University of Science and Technology

Collaborative Variational Autoencoder for Recommender Systems (Page 305)


Shen, Chih-Ya, National Tsing Hua University

On Finding Socially Tenuous Groups for Online Social Networks (Page 415)


Shen, Yelong, Microsoft Research

ReasoNet: Learning to Stop Reading in Machine Comprehension (Page 1047)


Shen, Zhihong, Microsoft Research

A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations (Page 1437)


Shi, Conglei, IBM Research

Is the Whole Greater Than the Sum of Its Parts? (Page 295)


(Return to Top)

Shi, Guangsha, University of Michigan

A Data Science Approach to Understanding Residential Water Contamination in Flint (Page 1407)


Shi, Yu, University of Illinois at Urbana-Champaign

PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks (Page 425)


Shi, Yue, Yahoo! Research

On Sampling Strategies for Neural Network-based Collaborative Filtering (Page 767)


Shin, Kijung, Carnegie Mellon University

DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams (Page 1057)


Shrivastava, Anshumali, Rice University

Scalable and Sustainable Deep Learning via Randomized Hashing (Page 445)


Shtutman, Michael, University of South Carolina

MOLIERE: Automatic Biomedical Hypothesis Generation System (Page 1633)


(Return to Top)

Shuai, Hong-Han, National Chiao Tung University

On Finding Socially Tenuous Groups for Online Social Networks (Page 415)


Shwartz, Larisa, IBM T.J. Watson Research Center

STAR: A System for Ticket Analysis and Resolution (Page 2181)


Si, Luo, Alibaba Group

Cascade Ranking for Operational E-commerce Search (Page 1557)


Si, Si, Google Inc. & Google Research

Communication-Efficient Distributed Block Minimization for Nonlinear Kernel Machines (Page 245)


Siffer, Alban, IRISA

Anomaly Detection in Streams with Extreme Value Theory (Page 1067)


Signorini, Alessio, Evidation Health, Inc.

Collecting and Analyzing Millions of mHealth Data Streams (Page 1971)


(Return to Top)

Silva, Daniel, Google Research

A Practical Algorithm for Solving the Incoherence Problem of Topic Models In Industrial Applications (Page 1713)


Silvestri, Fabrizio, Facebook

Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking (Page 465)


Simoudis, Evangelos, Synapse Partners

Foreword to the Applied Data Science - Invited Talks Track at KDD-2018 (Page 7)


Singh, Mayank, IIT Kharagpur

Relay-Linking Models for Prominence and Obsolescence in Evolving Networks (Page 1077)


Smith, Jamie, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)


Smith, Michael, Google Australia

Quick Access: Building a Smart Experience for Google Drive (Page 1643)


(Return to Top)

Soergel, David, Google, Inc.

TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks (Page 1763)


Solnik, Benjamin, Google Research

Google Vizier: A Service for Black-Box Optimization (Page 1487)


Song, Chaoming, University of Miami

A Temporally Heterogeneous Survival Framework with Application to Social Behavior Dynamics (Page 1295)


Song, Hwanjun, Korea Advanced Institute of Science and Technology

PAMAE: Parallel k-Medoids Clustering with High Accuracy and Efficiency (Page 1087)


Song, Le, Georgia Institute of Technology

GRAM: Graph-based Attention Model for Healthcare Representation Learning (Page 787)


Song, Qingquan, Texas A&M University

Multi-Aspect Streaming Tensor Completion (Page 435)


Song, Yangqiu, Hong Kong University of Science and Technology

Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks (Page 635)

HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network (Page 1507)


(Return to Top)

Soni, Akshay, Yahoo Research

Online Ranking with Constraints: A Primal-Dual Algorithm and Applications to Web Traffic-Shaping (Page 405)


Soska, Kyle, Carnegie Mellon University

Automatic Application Identification from Billions of Files (Page 2021)


Spring, Ryan, Rice University

Scalable and Sustainable Deep Learning via Randomized Hashing (Page 445)


Srinivasan, Vidyuth, Entrupy Inc

The Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue (Page 2011)


Srivastava, Ashok, Verizon

Foreword to the Applied Data Science - Invited Talks Track at KDD-2019 (Page 7)


St.Amand, Joseph, University of Kansas

Sparse Compositional Local Metric Learning (Page 1097)


(Return to Top)

Stein, Leon, eBay Inc.

Visual Search at eBay (Page 2101)


Stewart, Walter F., Sutter Health

GRAM: Graph-based Attention Model for Healthcare Representation Learning (Page 787)

LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity (Page 1315)


Su, Lu, SUNY Buffalo

Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources (Page 967)


Subramanian, Lakshminarayanan, Entrupy Inc and New York University

The Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue (Page 2011)


Sugawara, Shinya, University of Tokyo

Decomposed Normalized Maximum Likelihood Codelength Criterion for Selecting Hierarchical Latent Variable Models (Page 1165)


Sugiura, Hiroki, The University of Tokyo

Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes (Page 2041)


Sun, Jimeng, Georgia Institute of Technology

Federated Tensor Factorization for Computational Phenotyping (Page 887)

GRAM: Graph-based Attention Model for Healthcare Representation Learning (Page 787)

LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity (Page 1315)

SPARTan: Scalable PARAFAC2 for Large & Sparse Data (Page 375)


(Return to Top)

Sun, Leilei, Tsinghua University

A Data-driven Process Recommender Framework (Page 2111)

Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams (Page 335)

Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion (Page 957)


Sun, Mengying, Michigan State University

Multi-Modality Disease Modeling via Collective Deep Matrix Factorization (Page 1155)


Sun, Tong, United Technologies Research Center & Xerox

Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks (Page 1903)


Sun, Yizhou, University of California, Los Angeles

On Sampling Strategies for Neural Network-based Collaborative Filtering (Page 767)

The Co-Evolution Model for Social Network Evolving and Opinion Migration (Page 175)


Swami, Ananthram, Army Research Laboratory

metapath2vec: Scalable Representation Learning for Heterogeneous Networks (Page 135)


Sybrandt, Justin, Clemson University

MOLIERE: Automatic Biomedical Hypothesis Generation System (Page 1633)


Szolovits, Peter, Massachusetts Institute of Technology

Predicting Clinical Outcomes Across Changing Electronic Health Record Systems (Page 1497)

 

(Return to Top to Navigate the KDD'17 Author Index)