default search action
19th KDD 2013: Chicago, IL, USA
- Inderjit S. Dhillon, Yehuda Koren, Rayid Ghani, Ted E. Senator, Paul Bradley, Rajesh Parekh, Jingrui He, Robert L. Grossman, Ramasamy Uthurusamy:
The 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2013, Chicago, IL, USA, August 11-14, 2013. ACM 2013, ISBN 978-1-4503-2174-7
Keynotes
- Raghu Ramakrishnan:
Scale-out beyond map-reduce. 1 - Andrew Y. Ng, Daphne Koller:
The online revolution: education for everyone. 2 - Stephen J. Wright:
Optimization in learning and data analysis. 3 - Hal R. Varian:
Predicting the present with search engine data. 4
Document and topic models
- Jian Tang, Ming Zhang, Qiaozhu Mei:
One theme in all views: modeling consensus topics in multiple contexts. 5-13 - Khalid El-Arini, Min Xu, Emily B. Fox, Carlos Guestrin:
Representing documents through their readers. 14-22 - Kevin Bache, David Newman, Padhraic Smyth:
Text-based measures of document diversity. 23-31 - Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur:
Diversity maximization under matroid constraints. 32-40
Social media
- Reza Zafarani, Huan Liu:
Connecting users across social media sites: a behavioral-modeling approach. 41-49 - Tadej Stajner, Bart Thomee, Ana-Maria Popescu, Marco Pennacchiotti, Alejandro Jaimes:
Automatic selection of social media responses to news. 50-58 - Jaewon Yang, Bee-Chung Chen, Deepak Agarwal:
Estimating sharer reputation via social data calibration. 59-67 - Wei Shen, Jianyong Wang, Ping Luo, Min Wang:
Linking named entities in Tweets with knowledge base via user interest modeling. 68-76
Big data frameworks
- Wook-Shin Han, Sangyeon Lee, Kyungyeol Park, Jeong-Hoon Lee, Min-Soo Kim, Jinha Kim, Hwanjo Yu:
TurboGraph: a fast parallel graph engine handling billion-scale graphs in a single PC. 77-85 - Karthik Raman, Adith Swaminathan, Johannes Gehrke, Thorsten Joachims:
Beyond myopic inference in big data pipelines. 86-94 - John F. Canny, Huasha Zhao:
Big data analytics with small footprint: squaring the cloud. 95-103
Graph mining
- Charalampos E. Tsourakakis, Francesco Bonchi, Aristides Gionis, Francesco Gullo, Maria A. Tsiarli:
Denser than the densest subgraph: extracting optimal quasi-cliques with quality guarantees. 104-112 - Sean Gilpin, Tina Eliassi-Rad, Ian N. Davidson:
Guided learning for role discovery (GLRD): framework, algorithms, and applications. 113-121 - Jia Wang, James Cheng, Ada Wai-Chee Fu:
Redundancy-aware maximal cliques. 122-130 - Quanquan Gu, Charu C. Aggarwal, Jialu Liu, Jiawei Han:
Selective sampling on graphs for classification. 131-139
Classification
- Wenlin Chen, Yixin Chen, Yi Mao, Baolong Guo:
Density-based logistic regression. 140-148 - Dan Zhang, Jingrui He, Richard D. Lawrence:
MI2LS: multi-instance learning from multiple informationsources. 149-157 - Zheng Wang, Jieping Ye:
Querying discriminative and representative samples for batch mode active learning. 158-166 - Harikrishna Narasimhan, Shivani Agarwal:
SVMpAUCtight: a new support vector method for optimizing partial AUC based on a tight convex upper bound. 167-175
Healthcare and bioinformatics
- Yasuo Tabei, Akihiro Kishimoto, Masaaki Kotera, Yoshihiro Yamanishi:
Succinct interval-splitting tree for scalable similarity search of compound-protein pairs with property constraints. 176-184 - Shuo Xiang, Lei Yuan, Wei Fan, Yalin Wang, Paul M. Thompson, Jieping Ye:
Multi-source learning with block-wise missing data for Alzheimer's disease prediction. 185-193 - Ian N. Davidson, Sean Gilpin, Owen T. Carmichael, Peter B. Walker:
Network discovery via constrained tensor analysis of fMRI data. 194-202
Recommender systems
- Mahashweta Das, Gianmarco De Francisci Morales, Aristides Gionis, Ingmar Weber:
Learning to question: leveraging user preferences for shopping advice. 203-211 - Danica J. Sutherland, Barnabás Póczos, Jeff G. Schneider:
Active learning and search on low-rank matrices. 212-220 - Hongzhi Yin, Yizhou Sun, Bin Cui, Zhiting Hu, Ling Chen:
LCARS: a location-content-aware recommender system. 221-229
Scalable methods for big data
- Mingdong Ou, Peng Cui, Fei Wang, Jun Wang, Wenwu Zhu, Shiqiang Yang:
Comparing apples to oranges: a scalable solution with heterogeneous hashing. 230-238 - Ninh Pham, Rasmus Pagh:
Fast and scalable polynomial kernels via explicit feature maps. 239-247 - Ian En-Hsu Yen, Chun-Fu Chang, Ting-Wei Lin, Shan-Wei Lin, Shou-De Lin:
Indexed block coordinate descent for large-scale linear classification with limited memory. 248-256 - Siddharth Gopal, Yiming Yang:
Recursive regularization for large-scale classification with hierarchical and graphical dependencies. 257-265
Temporal/social influence
- Tomoharu Iwata, Amar Shah, Zoubin Ghahramani:
Discovering latent influence in online social activities via shared cascade poisson processes. 266-274 - Konstantin Kutzkov, Albert Bifet, Francesco Bonchi, Aristides Gionis:
STRIP: stream learning of influence probabilities. 275-283 - Mohammad Taha Bahadori, Yan Liu, Eric P. Xing:
Fast structure learning in generalized stochastic processes with latent factors. 284-292
Sparse learning
- Aurélie C. Lozano, Huijing Jiang, Xinwei Deng:
Robust sparse estimation of multiresponse regression and inverse covariance matrix via the L2 distance. 293-301 - Ping Li, Cun-Hui Zhang:
Exact sparse recovery with L0 projections. 302-310 - Qian Sun, Shuo Xiang, Jieping Ye:
Robust principal component analysis via capped norms. 311-319
Graph clustering
- Wei Cheng, Xiang Zhang, Zhishan Guo, Yubao Wu, Patrick F. Sullivan, Wei Wang:
Flexible and robust co-regularized multi-domain graph clustering. 320-328 - Johan Ugander, Brian Karrer, Lars Backstrom, Jon M. Kleinberg:
Graph cluster randomization: network exposure to multiple universes. 329-337 - Yang Zhou, Ling Liu:
Social influence based clustering of heterogeneous information networks. 338-346
Diffusion in social networks
- Jie Tang, Sen Wu, Jimeng Sun:
Confluence: conformity influence in large social networks. 347-355 - Lilian Weng, Jacob Ratkiewicz, Nicola Perra, Bruno Gonçalves, Carlos Castillo, Francesco Bonchi, Rossano Schifanella, Filippo Menczer, Alessandro Flammini:
The role of information diffusion in the evolution of social networks. 356-364 - Shuyang Lin, Fengjiao Wang, Qingbo Hu, Philip S. Yu:
Extracting social events for learning better information diffusion models. 365-373
Time series and spatial data
- Assaf Hallak, Dotan Di Castro, Shie Mannor:
Model selection in markovian processes. 374-382 - Yanping Chen, Bing Hu, Eamonn J. Keogh, Gustavo E. A. P. A. Batista:
DTW-D: time series semi-supervised learning from a single example. 383-391 - Huanhuan Chen, Fengzhen Tang, Peter Tiño, Xin Yao:
Model-based kernel for efficient time series analysis. 392-400
Diffusion in social networks
- Milad Eftekhar, Yashar Ganjali, Nick Koudas:
Information cascade at group scale. 401-409
Time series and spatial data
- Lu-An Tang, Xiao Yu, Quanquan Gu, Jiawei Han, Alice Leung, Thomas La Porta:
Mining lines in the sand: on trajectory discovery from untrustworthy data in cyber-physical system. 410-418
Unsupervised and topic learning
- Ariel Kleiner, Ameet Talwalkar, Sameer Agarwal, Ion Stoica, Michael I. Jordan:
A general bootstrap performance diagnostic. 419-427 - Arthur Zimek, Matthew Gaudet, Ricardo J. G. B. Campello, Jörg Sander:
Subsampling for efficient and effective unsupervised outlier detection ensembles. 428-436 - Chi Wang, Marina Danilevsky, Nihit Desai, Yinan Zhang, Phuong Nguyen, Thrivikrama Taula, Jiawei Han:
A phrase mining framework for recursive construction of a topical hierarchy. 437-445 - James R. Foulds, Levi Boyles, Christopher DuBois, Padhraic Smyth, Max Welling:
Stochastic collapsed variational Bayesian inference for latent Dirichlet allocation. 446-454
Social and information networks
- Caleb Chen Cao, Yongxin Tong, Lei Chen, H. V. Jagadish:
WiseMarket: a new paradigm for managing wisdom of online social users. 455-463 - Xi Wang, Gita Sukthankar:
Multi-label relational neighbor classification using social context features. 464-472 - Yaojia Zhu, Xiaoran Yan, Lise Getoor, Cristopher Moore:
Scalable text and link analysis with mixed-topic link models. 473-481 - Yangqiu Song, Zhengdong Lu, Cane Wing-ki Leung, Qiang Yang:
Collaborative boosting for activity classification in microblogs. 482-490
Graph mining and sampling
- Bruno D. Abrahao, Flavio Chierichetti, Robert Kleinberg, Alessandro Panconesi:
Trace complexity of network inference. 491-499 - Abhimanyu Das, Sreenivas Gollapudi, Rina Panigrahy, Mahyar Salek:
Debiasing social wisdom. 500-508 - Sayan Ranu, Minh X. Hoang, Ambuj K. Singh:
Mining discriminative subgraphs from global-state networks. 509-517 - Pranay Anchuri, Mohammed J. Zaki, Omer Barkol, Shahar Golan, Moshe Shamy:
Approximate graph mining with label costs. 518-526
Rule and pattern mining
- Chunyang Liu, Ling Chen, Chengqi Zhang:
Summarizing probabilistic frequent patterns: a fast approach. 527-535 - Cheng-Wei Wu, Yu-Feng Lin, Philip S. Yu, Vincent S. Tseng:
Mining high utility episodes in complex event sequences. 536-544 - Entong Shen, Ting Yu:
Mining frequent graph patterns with differential privacy. 545-553
Web mining
- Yukino Baba, Hisashi Kashima:
Statistical quality estimation for general crowdsourcing tasks. 554-562 - Taifeng Wang, Jiang Bian, Shusen Liu, Yuyu Zhang, Tie-Yan Liu:
Psychological advertising: exploring user psychology for click prediction in sponsored search. 563-571 - Simon Lacoste-Julien, Konstantina Palla, Alex Davies, Gjergji Kasneci, Thore Graepel, Zoubin Ghahramani:
SIGMa: simple greedy matching for aligning large knowledge bases. 572-580
Best paper session
- Edo Liberty:
Simple and deterministic matrix sketching. 581-588 - Madhav Jha, C. Seshadhri, Ali Pinar:
A space efficient streaming algorithm for triangle counting using the birthday paradox. 589-597
Research poster session
- Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, Nadia Magnenat-Thalmann:
Who, where, when and what: discover spatio-temporal topics for twitter users. 605-613 - Xiangnan Kong, Bokai Cao, Philip S. Yu:
Multi-label classification by mining label and instance correlations from heterogeneous information networks. 614-622 - Yin Lou, Rich Caruana, Johannes Gehrke, Giles Hooker:
Accurate intelligible models with pairwise interactions. 623-631 - Arjun Mukherjee, Abhinav Kumar, Bing Liu, Junhui Wang, Meichun Hsu, Malú Castellanos, Riddhiman Ghosh:
Spotting opinion spammers using behavioral footprints. 632-640 - Sen Yang, Jie Wang, Wei Fan, Xiatian Zhang, Peter Wonka, Jieping Ye:
An efficient ADMM algorithm for multidimensional anisotropic total variation regularization problems. 641-649 - Deepayan Chakrabarti, Ralf Herbrich:
Speeding up large-scale learning with a social prior. 650-658 - Santosh Kabbur, Xia Ning, George Karypis:
FISM: factored item similarity models for top-N recommender systems. 659-667 - Shouichi Nagano, Yusuke Ichikawa, Noriko Takaya, Tadasu Uchiyama, Makoto Abe:
Nonparametric hierarchal bayesian modeling in non-contractual heterogeneous survival data. 668-676 - Kaixiang Mo, Erheng Zhong, Qiang Yang:
Cross-task crowdsourcing. 677-685 - Manas Joglekar, Hector Garcia-Molina, Aditya G. Parameswaran:
Evaluating the crowd with confidence. 686-694 - Yuchen Zhao, Guan Wang, Philip S. Yu, Shaobo Liu, Simon Zhang:
Inferring social roles and statuses in social networks. 695-703 - Siyuan Liu, Yisong Yue, Ramayya Krishnan:
Adaptive collective routing using gaussian process dynamic congestion models. 704-712 - De-Nian Yang, Hui-Ju Hung, Wang-Chien Lee, Wei Chen:
Maximizing acceptance probability for active friending in online social networks. 713-721 - Xiting Wang, Shixia Liu, Yangqiu Song, Baining Guo:
Mining evolutionary multi-branch trees from text streams. 722-730 - Xuezhi Wang, Roman Garnett, Jeff G. Schneider:
Active search on graphs. 731-738 - Da Kuang, Haesun Park:
Fast rank-2 nonnegative matrix factorization for hierarchical document clustering. 739-747 - Jingbo Zhou, Anthony K. H. Tung, Wei Wu, Wee Siong Ng:
A "semi-lazy" approach to probabilistic path prediction. 748-756 - Lu Zheng, Ole J. Mengshoel:
Optimizing parallel belief propagation in junction treesusing regression. 757-765 - Liang Ge, Jing Gao, Xiaoyi Li, Aidong Zhang:
Multi-source deep learning for information trustworthiness estimation. 766-774 - Tsung-Ting Kuo, Rui Yan, Yu-Yang Huang, Perng-Hwa Kung, Shou-De Lin:
Unsupervised link prediction using aggregative statistics on heterogeneous social networks. 775-783 - Conrad Lee, Bobo Nick, Ulrik Brandes, Pádraig Cunningham:
Link prediction with social vector clocks. 784-792 - Dmytro Karamshuk, Anastasios Noulas, Salvatore Scellato, Vincenzo Nicosia, Cecilia Mascolo:
Geo-spotting: mining online location-based services for optimal retail store placement. 793-801 - Guoliang Li, Yang Wang, Ting Wang, Jianhua Feng:
Location-aware publish/subscribe. 802-810 - Jiangwen Sun, Jinbo Bi, Henry R. Kranzler:
Quadratic optimization to identify highly heritable quantitative traits from complex phenotypic features. 811-819 - Dóra Erdös, Vatche Ishakian, Azer Bestavros, Evimaria Terzi:
Repetition-aware content placement in navigational networks. 820-828 - Ye Wang, Ahmed Metwally, Srinivasan Parthasarathy:
Scalable all-pairs similarity search in metric spaces. 829-837 - Muzaffer Can Altinigneli, Claudia Plant, Christian Böhm:
Massively parallel expectation maximization using graphics processing units. 838-846 - Chris Thornton, Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown:
Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms. 847-855 - Ming Tan, Tian Xia, Lily Guo, Shaojun Wang:
Direct optimization of ranking measures for learning to rank models. 856-864 - Shuo Chen, Jiexun Xu, Thorsten Joachims:
Multi-space probabilistic sequence modeling. 865-873 - Yuan Hao, Yanping Chen, Jesin Zakaria, Bing Hu, Thanawin Rakthanmanon, Eamonn J. Keogh:
Towards never-ending learning from time series streams. 874-882 - Yang Mu, Wei Ding, Tianyi Zhou, Dacheng Tao:
Constrained stochastic gradient descent for large-scale least squares problem. 883-891 - Wei Chen, Wynne Hsu, Mong-Li Lee:
Making recommendations from multiple domains. 892-900 - Peng Cui, Shifei Jin, Linyun Yu, Fei Wang, Wenwu Zhu, Shiqiang Yang:
Cascading outbreak prediction in networks: a data-driven approach. 901-909 - Wei Zhang, Jianyong Wang, Wei Feng:
Combining latent factor model with location features for event-based group recommendation. 910-918 - Peilin Zhao, Steven C. H. Hoi:
Cost-sensitive online active learning with application to malicious URL detection. 919-927 - Wei Lu, Francesco Bonchi, Amit Goyal, Laks V. S. Lakshmanan:
The bang for the buck: fair competitive viral marketing from the host perspective. 928-936 - Erheng Zhong, Wei Fan, Yin Zhu, Qiang Yang:
Modeling the dynamics of composite social networks. 937-945 - Goce Ristanoski, Wei Liu, James Bailey:
A time-dependent enhanced support vector machine for time series regression. 946-954 - Katja Niemann, Martin Wolpers:
A new collaborative filtering approach for increasing the aggregate diversity of recommender systems. 955-963 - Jun Zhu, Xun Zheng, Li Zhou, Bo Zhang:
Scalable inference in max-margin topic models. 964-972 - Gartheeban Ganeshapillai, John V. Guttag:
A data-driven method for in-game decision making in MLB: when to pull a starting pitcher. 973-979 - Xiao Bai, Flavio Paiva Junqueira, Srinivasan H. Sengamedu:
Exploiting user clicks for automatic seed set generation for entity matching. 980-988 - Peifeng Yin, Ping Luo, Wang-Chien Lee, Min Wang:
Silence is also evidence: interpreting dwell time for recommendation from psychological perspective. 989-997 - Andy Diwen Zhu, Xiaokui Xiao, Sibo Wang, Wenqing Lin:
Efficient single-source shortest path and distance queries on large graphs. 998-1006 - Marek Ciglan, Michal Laclavik, Kjetil Nørvåg:
On community detection in real-world networks and the importance of degree assortativity. 1007-1015 - Xiaohui Bei, Ning Chen, Liyu Dou, Xiangru Huang, Ruixin Qiang:
Trial and error in influential social networks. 1016-1024 - Xiaodong Zheng, Hao Ding, Hiroshi Mamitsuka, Shanfeng Zhu:
Collaborative matrix factorization with multiple similarities for predicting drug-target interactions. 1025-1033 - Jiayu Zhou, Zhaosong Lu, Jimeng Sun, Lei Yuan, Fei Wang, Jieping Ye:
FeaFiner: biomarker identification from medical data through feature generalization and selection. 1034-1042 - Bin Liu, Yanjie Fu, Zijun Yao, Hui Xiong:
Learning geographical preferences for point-of-interest recommendation. 1043-1051 - Sebastián Moreno, Jennifer Neville, Sergey Kirshner:
Learning mixed kronecker product graph models with simulated method of moments. 1052-1060 - Komal Kapoor, Nisheeth Srivastava, Jaideep Srivastava, Paul R. Schrater:
Measuring spontaneous devaluations in user preferences. 1061-1069 - Yang Li, Chi Wang, Fangqiu Han, Jiawei Han, Dan Roth, Xifeng Yan:
Mining evidences for named entity disambiguation. 1070-1078 - Aaron Johnson, Vitaly Shmatikov:
Privacy-preserving data exploration in genome-wide association studies. 1079-1087 - Huan Sun, Alex Morales, Xifeng Yan:
Synthetic review spamming and defense. 1088-1096 - Dafna Shahaf, Jaewon Yang, Caroline Suen, Jeff Jacobs, Heidi Wang, Jure Leskovec:
Information cartography: creating zoomable, large-scale maps of information. 1097-1105 - Joel Nishimura, Johan Ugander:
Restreaming graph partitioning: simple versatile algorithms for advanced balancing. 1106-1114 - Xiaolong Wang, Chengxiang Zhai, Dan Roth:
Understanding evolution of research themes: a probabilistic generative model for citations. 1115-1123 - Xiao Cai, Chris H. Q. Ding, Feiping Nie, Heng Huang:
On the equivalent of low-rank linear regressions and linear discriminant analysis based regressions. 1124-1132
Industry practice expo invited presentations
- Oren Etzioni:
To buy or not to buy: that is the question. 1133 - Eric E. Schadt:
Mining the digital universe of data to develop personalized cancer therapies. 1134 - Jeremy Howard:
The business impact of deep learning. 1135 - Ari Gesher:
Adaptive adversaries: building systems to fight fraud and cyber intruders. 1136 - Rayid Ghani:
Targeting and influencing at scale: from presidential elections to social good. 1137 - Milind Bhandarkar:
Hadoop: a view from the trenches. 1138 - Raffael Marty:
Cyber security: how visual analytics unlock insight. 1139 - Chris Neumann:
Using "big data" to solve "small data" problems. 1140
Industrial and government deployed
- Kareem S. Aggour, Bethany Hoogs:
Financing lead triggers: empowering sales reps through knowledge discovery and fusion. 1141-1149 - Ye Chen, Weiguo Liu, Jeonghee Yi, Anton Schwaighofer, Tak W. Yan:
Query clustering based on bid landscape for sponsored search auction optimization. 1150-1158 - Einat Kermany, Hanna Mazzawi, Dorit Baras, Yehuda Naveh, Hagai Michaelis:
Analysis of advanced meter infrastructure data of water consumption in apartment buildings. 1159-1167 - Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, Nils Pohlmann:
Online controlled experiments at large scale. 1168-1176 - Wenxing Hong, Lei Li, Tao Li, Wenfu Pan:
iHR: an online recruiting system for Xiamen Talent Service Center. 1177-1185 - Nima Asadi, Jimmy Lin, Michael Busch:
Dynamic memory allocation policies for postings in real-time Twitter search. 1186-1194 - Luo Jie, Sudarshan Lamkhede, Rochit Sapra, Evans Hsu, Helen Song, Yi Chang:
A unified search federation system based on online user feedback. 1195-1203 - Prem Melville, Vijil Chenthamarakshan, Richard D. Lawrence, James Powell, Moses Mugisha, Sharad Sapra, Rajesh Anandan, Solomon Assefa:
Amplifying the voice of youth in Africa via text analytics. 1204-1212 - Troy Raeder, Claudia Perlich, Brian Dalessandro, Ori Stitelman, Foster J. Provost:
Scalable supervised dimensionality reduction using clustering. 1213-1221 - H. Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, Sharat Chikkerur, Dan Liu, Martin Wattenberg, Arnar Mar Hrafnkelsson, Tom Boulos, Jeremy Kubica:
Ad click prediction: a view from the trenches. 1222-1230 - Xuan Song, Quanshi Zhang, Yoshihide Sekimoto, Teerayut Horanont, Satoshi Ueyama, Ryosuke Shibasaki:
Modeling and probabilistic reasoning of population evacuation during large-scale disaster. 1231-1239 - Ori Stitelman, Claudia Perlich, Brian Dalessandro, Rod Hook, Troy Raeder, Foster J. Provost:
Using co-visitation networks for detecting large scale online display advertising exchange fraud. 1240-1248 - Liang Tang, Tao Li, Larisa Shwartz, Florian Pinel, Genady Grabarnik:
An integrated framework for optimizing automatic monitoring systems in large IT infrastructures. 1249-1257 - Sholom M. Weiss, Amit Dhurandhar, Robert J. Baseman:
Improving quality control by early prediction of manufacturing outcomes. 1258-1266
Industrial and government discovery
- Daniel Emerson, Justin Weligamage, Richi Nayak:
A data mining driven risk profiling method for road asset management. 1267-1275 - Bin Fu, Jialiu Lin, Lei Li, Christos Faloutsos, Jason I. Hong, Norman M. Sadeh:
Why people hate your app: making sense of user feedback in a mobile app store. 1276-1284 - Dawei Wang, Wei Ding, Kui Yu, Xindong Wu, Ping Chen, David L. Small, Shafiqul Islam:
Towards long-lead forecasting of extreme flood events: a data mining framework for precipitation cluster precursors identification. 1285-1293 - Jeonghee Yi, Ye Chen, Jie Li, Swaraj Sett, Tak W. Yan:
Predictive model performance: offline and online evaluations. 1294-1302
Industrial and government emerging
- Eytan Bakshy, Dean Eckles:
Uncertainty in online experiments with dependent data: an evaluation of bootstrap methods. 1303-1311 - Varun Chandola, Sreenivas R. Sukumar, Jack C. Schryver:
Knowledge discovery from massive healthcare claims data. 1312-1320 - Anurag Bhardwaj, Atish Das Sarma, Wei Di, Raffay Hamid, Robinson Piramuthu, Neel Sundaresan:
Palette power: enabling visual search through colors. 1321-1329 - Hongliang Fei, Younghun Kim, Sambit Sahu, Milind R. Naphade, Sanjay K. Mamidipalli, John Hutchinson:
Heat pump detection from coarse grained smart meter data with positive and unlabeled learning. 1330-1338 - Rave Harpaz, William DuMouchel, Paea LePendu, Nigam H. Shah:
Empirical bayes model to combine signals of adverse drug reactions. 1339-1347 - Christiane Kamdem Kengne, Léon Constantin Fopa, Alexandre Termier, Noha Ibrahim, Marie-Christine Rousset, Takashi Washio, Miguel Santana:
Efficiently rewriting large multimedia application execution traces with few event sequences. 1348-1356 - Deguang Kong, Guanhua Yan:
Discriminant malware distance learning on structural information for automated malware classification. 1357-1365 - Patrick Lucey, Dean Oliver, Peter Carr, Joe Roth, Iain A. Matthews:
Assessing team strategy using spatiotemporal data. 1366-1374 - Arun S. Maiya, John P. Thompson, Francisco Loaiza-Lemos, Robert M. Rolfe:
Exploratory analysis of highly heterogeneous document collections. 1375-1383 - Thomas A. Montgomery, Paul M. Stieg, Michael J. Cavaretta, Paul E. Moraal:
Experience from hosting a corporate prediction market: benefits beyond the forecasts. 1384-1392 - Ted E. Senator, Henry G. Goldberg, Alex Memory, William T. Young, Brad Rees, Robert Pierce, Daniel Huang, Matthew Reardon, David A. Bader, Edmond Chow, Irfan A. Essa, Joshua Jones, Vinay Bettadapura, Duen Horng Chau, Oded Green, Oguz Kaya, Anita Zakrzewska, Erica Briscoe, Rudolph L. Mappus IV, Robert McColl, Lora Weiss, Thomas G. Dietterich, Alan Fern, Weng-Keen Wong, Shubhomoy Das, Andrew Emmott, Jed Irvine, Jay Yoon Lee, Danai Koutra, Christos Faloutsos, Daniel D. Corkill, Lisa Friedland, Amanda Gentzel, David D. Jensen:
Detecting insider threats in a real corporate database of computer usage activity. 1393-1401 - Paulo Shakarian, Patrick Roos, Devon Callahan, Cory Kirk:
Mining for geographically disperse communities in social networks by leveraging distance modularity. 1402-1409 - Truyen Tran, Dinh Q. Phung, Wei Luo, Richard Harvey, Michael Berk, Svetha Venkatesh:
An integrated framework for suicide risk prediction. 1410-1418 - Ranga Raju Vatsavai:
Gaussian multiple instance learning approach for mapping the slums of the world using very high resolution imagery. 1419-1426 - Huayu Wu, Wee Siong Ng, Kian-Lee Tan, Wei Wu, Shili Xiang, Mingqiang Xue:
A privacy preserving framework for managing vehicle data in road pricing systems. 1427-1435 - Yu Zheng, Furui Liu, Hsun-Ping Hsieh:
U-Air: when urban air quality inference meets big data. 1436-1444
Panel
- Foster J. Provost, Geoffrey I. Webb:
Panel: a data scientist's guide to making money from start-ups. 1445
Demonstrations
- Mohamed Reda Bouadjenek, Hakim Hacid, Mokrane Bouzeghoub:
LAICOS: an open source platform for personalized social web search. 1446-1449 - Yu Cheng, Yusheng Xie, Zhengzhang Chen, Ankit Agrawal, Alok N. Choudhary, Songtao Guo:
JobMiner: a real-time system for mining job-related patterns from social media. 1450-1453 - Meng-Fen Chiang, Yung-Hsiang Lin, Wen-Chih Peng, Philip S. Yu:
Inferring distant-time location in low-sampling-rate trajectories. 1454-1457 - Marina Danilevsky, Chi Wang, Fangbo Tao, Son Nguyen, Gong Chen, Nihit Desai, Lidan Wang, Jiawei Han:
AMETHYST: a system for mining and exploring topical hierarchies of heterogeneous data. 1458-1461 - Pritam Gundecha, Suhas Ranganath, Zhuo Feng, Huan Liu:
A tool for collecting provenance data in social media. 1462-1465 - Ting Hua, Feng Chen, Liang Zhao, Chang-Tien Lu, Naren Ramakrishnan:
STED: semi-supervised targeted-interest event detectionin in twitter. 1466-1469 - Fang Jin, Nathan Self, Parang Saraf, Patrick Butler, Wei Wang, Naren Ramakrishnan:
Forex-foreteller: currency trend modeling using news articles. 1470-1473 - Kathy Lee, Ankit Agrawal, Alok N. Choudhary:
Real-time disease surveillance using Twitter data: demonstration on flu and cancer. 1474-1477 - Pei Lee, Laks V. S. Lakshmanan, Evangelos E. Milios:
KeySee: supporting keyword search on evolving events in social streams. 1478-1481 - Fred Morstatter, Shamanth Kumar, Huan Liu, Ross Maciejewski:
Understanding Twitter data with TweetXplorer. 1482-1485 - Mika Rautiainen, Jouni Sarvanko, Arto Heikkinen, Mika Ylianttila, Vassilis Kostakos:
An online system with end-user services: mining novelty concepts from tv broadcast subtitles. 1486-1489 - Céline Robardet, Vasile-Marian Scuturici, Marc Plantevit, Antoine Fraboulet:
When TEDDY meets GrizzLY: temporal dependency discovery for triggering road deicing operations. 1490-1493 - Fangbo Tao, Kin Hou Lei, Jiawei Han, Chengxiang Zhai, Xiao Cheng, Marina Danilevsky, Nihit Desai, Bolin Ding, Jing Ge, Heng Ji, Rucha Kanade, Anne Kao, Qi Li, Yanen Li, Cindy Xide Lin, Jialu Liu, Nikunj C. Oza, Ashok N. Srivastava, Rodney Tjoelker, Chi Wang, Duo Zhang, Bo Zhao:
EventCube: multi-dimensional search and mining of structured and text data. 1494-1497 - Yaqiong Wang, Hongfu Liu, Hao Lin, Junjie Wu, Zhiang Wu, Jie Cao:
SEA: a system for event analysis on chinese tweets. 1498-1501 - Yang Yang, Jianfei Wang, Yutao Zhang, Wei Chen, Jing Zhang, Honglei Zhuang, Zhilin Yang, Bo Ma, Zhanpeng Fang, Sen Wu, Xiaoxiao Li, Debing Liu, Jie Tang:
SAE: social analytic engine for large networks. 1502-1505 - Chunqiu Zeng, Yexi Jiang, Li Zheng, Jingxuan Li, Lei Li, Hongtai Li, Chao Shen, Wubai Zhou, Tao Li, Bing Duan, Ming Lei, Pengnian Wang:
FIU-Miner: a fast, integrated, and user-friendly system for data mining in distributed environment. 1506-1509 - Jun Zhang, Chaokun Wang, Yuanchi Ning, Yichi Liu, Jianmin Wang, Philip S. Yu:
LAFT-Explorer: inferring, visualizing and predicting how your social network expands. 1510-1513 - Zhou Zhao, Da Yan, Wilfred Ng, Shi Gao:
A transfer learning based framework of crowd-selection on twitter. 1514-1517 - Kiyana Zolfaghar, Jayshree Agarwal, Deepthi Sistla, Si-Chi Chin, Senjuti Basu Roy, Nele Verbiest:
Risk-O-Meter: an intelligent clinical risk calculator. 1518-1521
Tutorials
- Alan M. Frieze, Aristides Gionis, Charalampos E. Tsourakakis:
Algorithmic techniques for modeling and mining large graphs (AMAzING). 1523 - Spiros Papadimitriou, Tina Eliassi-Rad:
Mining data from mobile devices: a survey of smart sensing and analytics. 1524 - Jimeng Sun, Chandan K. Reddy:
Big data analytics for healthcare. 1525 - Lise Getoor, Ashwin Machanavajjhala:
Entity resolution for big data. 1527 - Lise Getoor, Ashwin Machanavajjhala:
Network sampling. 1528 - Amr Ahmed, Alexander J. Smola:
The dataminer's guide to scalable mixed-membership and nonparametric bayesian models. 1529
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.