default search action
Yanlei Diao
Person information
- affiliation: University of Massachusetts Amherst, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j27]Chenghao Lyu, Qi Fan, Philippe Guyard, Yanlei Diao:
A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning. Proc. VLDB Endow. 17(11): 3565-3579 (2024) - [j26]Luciano Di Palma, Yanlei Diao, Anna Liu:
Efficient Version Space Algorithms for Human-in-the-loop Model Development. ACM Trans. Knowl. Discov. Data 18(3): 69:1-69:49 (2024) - [j25]Enhui Huang, Yanlei Diao, Anna Liu, Liping Peng, Luciano Di Palma:
Efficient and robust active learning methods for interactive database exploration. VLDB J. 33(4): 931-956 (2024) - [c41]Yanlei Diao, Dominik Horn, Andreas Kipf, Oleksandr Shchur, Ines Benito, Wenjian Dong, Davide Pagano, Pascal Pfeil, Vikram Nathan, Balakrishnan Narayanaswamy, Tim Kraska:
Forecasting Algorithms for Intelligent Resource Scaling: An Experimental Analysis. SoCC 2024: 126-143 - [i10]Chenghao Lyu, Qi Fan, Philippe Guyard, Yanlei Diao:
A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning. CoRR abs/2403.00995 (2024) - 2022
- [j24]Chenghao Lyu, Qi Fan, Fei Song, Arnab Sinha, Yanlei Diao, Wei Chen, Li Ma, Yihui Feng, Yaliang Li, Kai Zeng, Jingren Zhou:
Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing. Proc. VLDB Endow. 15(11): 3098-3111 (2022) - [i9]Chenghao Lyu, Qi Fan, Fei Song, Arnab Sinha, Yanlei Diao, Wei Chen, Li Ma, Yihui Feng, Yaliang Li, Kai Zeng, Jingren Zhou:
Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing. CoRR abs/2207.02026 (2022) - 2021
- [j23]Vincent Jacob, Fei Song, Arnaud Stiegler, Bijan Rad, Yanlei Diao, Nesime Tatbul:
Exathlon: A Benchmark for Explainable Anomaly Detection over Time Series. Proc. VLDB Endow. 14(11): 2613-2626 (2021) - [j22]Vincent Jacob, Fei Song, Arnaud Stiegler, Bijan Rad, Yanlei Diao, Nesime Tatbul:
A Demonstration of the Exathlon Benchmarking Platform for Explainable Anomaly Detection. Proc. VLDB Endow. 14(12): 2827-2830 (2021) - [c40]Bijan Rad, Fei Song, Vincent Jacob, Yanlei Diao:
Explainable anomaly detection on high-dimensional time series data. DEBS 2021: 2-14 - [c39]Fei Song, Khaled Zaouk, Chenghao Lyu, Arnab Sinha, Qi Fan, Yanlei Diao, Prashant J. Shenoy:
Spark-based Cloud Data Analytics using Multi-Objective Optimization. ICDE 2021: 396-407 - [c38]Yanlei Diao, Pawel Guzewicz, Ioana Manolescu, Mirjana Mazuran:
Efficient Exploration of Interesting Aggregates in RDF Graphs. SIGMOD Conference 2021: 392-404 - [i8]Khaled Zaouk, Fei Song, Chenghao Lyu, Yanlei Diao:
Neural-based Modeling for Performance Tuning of Spark Data Analytics. CoRR abs/2101.08167 (2021) - [i7]Yanlei Diao, Pawel Guzewicz, Ioana Manolescu, Mirjana Mazuran:
Efficient Exploration of Interesting Aggregates in RDF Graphs. CoRR abs/2103.17178 (2021) - 2020
- [i6]Fei Song, Khaled Zaouk, Chenghao Lyu, Arnab Sinha, Qi Fan, Yanlei Diao, Prashant J. Shenoy:
Boosting Cloud Data Analytics using Multi-Objective Optimization. CoRR abs/2005.03314 (2020) - [i5]Vincent Jacob, Fei Song, Arnaud Stiegler, Yanlei Diao, Nesime Tatbul:
AnomalyBench: An Open Benchmark for Explainable Anomaly Detection. CoRR abs/2010.05073 (2020)
2010 – 2019
- 2019
- [j21]Yanlei Diao, Pawel Guzewicz, Ioana Manolescu, Mirjana Mazuran:
Spade: A Modular Framework for Analytical Exploration of RDF Graphs. Proc. VLDB Endow. 12(12): 1926-1929 (2019) - [j20]Khaled Zaouk, Fei Song, Chenghao Lyu, Arnab Sinha, Yanlei Diao, Prashant J. Shenoy:
UDAO: A Next-Generation Unified Data Analytics Optimizer. Proc. VLDB Endow. 12(12): 1934-1937 (2019) - [c37]Luciano Di Palma, Yanlei Diao, Anna Liu:
A Factorized Version Space Algorithm for "Human-In-the-Loop" Data Exploration. ICDM 2019: 1018-1023 - 2018
- [j19]Enhui Huang, Liping Peng, Luciano Di Palma, Ahmed Abdelkafi, Anna Liu, Yanlei Diao:
Optimization for Active Learning-based Interactive Database Exploration. Proc. VLDB Endow. 12(1): 71-84 (2018) - [c36]Fei Song, Boyao Zhou, Quan Sun, Wang Sun, Shiwen Xia, Yanlei Diao:
Anomaly Detection and Explanation Discovery on Event Streams. BIRTE 2018: 5:1-5:5 - [c35]Fei Song, Yanlei Diao, Jesse Read, Arnaud Stiegler, Albert Bifet:
EXAD: A System for Explainable Anomaly Detection on Big Data Traces. ICDM Workshops 2018: 1435-1440 - [r4]Yanlei Diao, Michael J. Franklin:
Publish/Subscribe Over Streams. Encyclopedia of Database Systems (2nd ed.) 2018 - [r3]Yanlei Diao, Michael J. Franklin:
XML Publish/Subscribe. Encyclopedia of Database Systems (2nd ed.) 2018 - 2017
- [c34]Haopeng Zhang, Yanlei Diao, Alexandra Meliou:
EXstream: Explaining Anomalies in Event Stream Monitoring. EDBT 2017: 156-167 - [c33]Yanlei Diao, Ioana Manolescu, Shu Shang:
Dagger: Digging for Interesting Aggregates in RDF Graphs. ISWC (Posters, Demos & Industry Tracks) 2017 - [c32]Abhishek Roy, Yanlei Diao, Uday Evani, Avinash Abhyankar, Clinton Howarth, Rémi Le Priol, Toby Bloom:
Massively Parallel Processing of Whole Genome Sequence Data: An In-Depth Performance Study. SIGMOD Conference 2017: 187-202 - 2016
- [j18]Olga Papaemmanouil, Yanlei Diao, Kyriaki Dimitriadou, Liping Peng:
Interactive Data Exploration via Machine Learning Models. IEEE Data Eng. Bull. 39(4): 38-49 (2016) - [j17]Kyriaki Dimitriadou, Olga Papaemmanouil, Yanlei Diao:
AIDE: An Active Learning-Based Approach for Interactive Data Exploration. IEEE Trans. Knowl. Data Eng. 28(11): 2842-2856 (2016) - [p2]Yanlei Diao, Michael J. Franklin:
High-Performance XML Message Brokering. Data Stream Management 2016: 451-471 - [e2]Marcos K. Aguilera, Brian Cooper, Yanlei Diao:
Proceedings of the Seventh ACM Symposium on Cloud Computing, Santa Clara, CA, USA, October 5-7, 2016. ACM 2016, ISBN 978-1-4503-4525-5 [contents] - 2015
- [j16]Boduo Li, Yanlei Diao, Prashant J. Shenoy:
Supporting Scalable Analytics with Latency Constraints. Proc. VLDB Endow. 8(11): 1166-1177 (2015) - [j15]Yanlei Diao, Kyriaki Dimitriadou, Zhan Li, Wenzhao Liu, Olga Papaemmanouil, Kemi Peng, Liping Peng:
AIDE: An Automatic User Navigation System for Interactive Data Exploration. Proc. VLDB Endow. 8(12): 1964-1967 (2015) - [c31]Yanlei Diao, Abhishek Roy, Toby Bloom:
Building Highly-Optimized, Low-Latency Pipelines for Genomic Data Analysis. CIDR 2015 - [c30]Yanlei Diao:
Explore-By-Example: A New Database Service for Interactive Data Exploration. ExploreDB@SIGMOD/PODS 2015: 1 - [c29]Liping Peng, Yanlei Diao:
Supporting Data Uncertainty in Array Databases. SIGMOD Conference 2015: 545-560 - [i4]Kyriaki Dimitriadou, Olga Papaemmanouil, Yanlei Diao:
AIDE: An Automated Sample-based Approach for Interactive Data Exploration. CoRR abs/1510.08897 (2015) - 2014
- [c28]Kyriaki Dimitriadou, Olga Papaemmanouil, Yanlei Diao:
Interactive data exploration based on user relevance feedback. ICDE Workshops 2014: 292-295 - [c27]Haopeng Zhang, Yanlei Diao, Neil Immerman:
On complexity and optimization of expensive queries in complex event processing. SIGMOD Conference 2014: 217-228 - [c26]Kyriaki Dimitriadou, Olga Papaemmanouil, Yanlei Diao:
Explore-by-example: an automatic query steering framework for interactive data exploration. SIGMOD Conference 2014: 517-528 - 2013
- [j14]Haopeng Zhang, Yanlei Diao, Neil Immerman:
Recognizing patterns in streams with imprecise timestamps. Inf. Syst. 38(8): 1187-1211 (2013) - [j13]Thanh T. L. Tran, Yanlei Diao, Charles Sutton, Anna Liu:
Supporting User-Defined Functions on Uncertain Data. Proc. VLDB Endow. 6(6): 469-480 (2013) - [j12]Yanlei Diao, Thomas Neumann:
Front Matter. Proc. VLDB Endow. 6(9): i-x (2013) - [c25]Ugur Çetintemel, Mitch Cherniack, Justin A. DeBrabant, Yanlei Diao, Kyriaki Dimitriadou, Alexander Kalinin, Olga Papaemmanouil, Stanley B. Zdonik:
Query Steering for Interactive Data Exploration. CIDR 2013 - [c24]Yanlei Diao:
A Science Fiction Talk. CIDR 2013 - [c23]Kaituo Li, Christoph Reichenbach, Yannis Smaragdakis, Yanlei Diao, Christoph Csallner:
SEDGE: Symbolic example data generation for dataflow programs. ASE 2013: 235-245 - 2012
- [j11]Abhishek Roy, Yanlei Diao, Evan Mauceli, Yiping Shen, Bai-Lin Wu:
Massive Genomic Data Processing and Deep Analysis. Proc. VLDB Endow. 5(12): 1906-1909 (2012) - [j10]Yanming Nie, Richard Cocci, Zhao Cao, Yanlei Diao, Prashant J. Shenoy:
SPIRE: Efficient Data Inference and Compression over RFID Streams. IEEE Trans. Knowl. Data Eng. 24(1): 141-155 (2012) - [j9]Boduo Li, Edward Mazur, Yanlei Diao, Andrew McGregor, Prashant J. Shenoy:
SCALLA: A Platform for Scalable One-Pass Analytics Using MapReduce. ACM Trans. Database Syst. 37(4): 27:1-27:43 (2012) - [j8]Thanh T. L. Tran, Liping Peng, Yanlei Diao, Andrew McGregor, Anna Liu:
CLARO: modeling and processing uncertain data streams. VLDB J. 21(5): 651-676 (2012) - 2011
- [j7]Zhao Cao, Charles Sutton, Yanlei Diao, Prashant J. Shenoy:
Distributed inference and query processing for RFID tracking and monitoring. Proc. VLDB Endow. 4(5): 326-337 (2011) - [j6]Liping Peng, Yanlei Diao, Anna Liu:
Optimizing Probabilistic Query Processing on Continuous Uncertain Data. Proc. VLDB Endow. 4(11): 1169-1180 (2011) - [c22]Edward Mazur, Boduo Li, Yanlei Diao, Prashant J. Shenoy:
Towards Scalable One-Pass Analytics Using MapReduce. IPDPS Workshops 2011: 1102-1111 - [c21]Boduo Li, Edward Mazur, Yanlei Diao, Andrew McGregor, Prashant J. Shenoy:
A platform for scalable one-pass analytics using MapReduce. SIGMOD Conference 2011: 985-996 - [c20]Michael Bendersky, W. Bruce Croft, Yanlei Diao:
Quality-biased ranking of web documents. WSDM 2011: 95-104 - [i3]Zhao Cao, Charles Sutton, Yanlei Diao, Prashant J. Shenoy:
Distributed Inference and Query Processing for RFID Tracking and Monitoring. CoRR abs/1103.4410 (2011) - 2010
- [j5]Haopeng Zhang, Yanlei Diao, Neil Immerman:
Recognizing Patterns in Streams with Imprecise Timestamps. Proc. VLDB Endow. 3(1): 244-255 (2010) - [j4]Thanh T. L. Tran, Andrew McGregor, Yanlei Diao, Liping Peng, Anna Liu:
Conditioning and Aggregating Uncertain Data Streams: Going Beyond Expectations. Proc. VLDB Endow. 3(1): 1302-1313 (2010) - [c19]Devesh Agrawal, Boduo Li, Zhao Cao, Deepak Ganesan, Yanlei Diao, Prashant J. Shenoy:
Exploiting the Interplay between Memory and Flash Storage in Embedded Sensor Devices. RTCSA 2010: 227-236 - [c18]Thanh T. L. Tran, Liping Peng, Boduo Li, Yanlei Diao, Anna Liu:
PODS: a new model and processing algorithms for uncertain data streams. SIGMOD Conference 2010: 159-170 - [p1]Fang Yu, Yanlei Diao, Randy H. Katz, T. V. Lakshman:
Fast Packet Pattern-Matching Algorithms. Algorithms for Next Generation Networks 2010: 219-238
2000 – 2009
- 2009
- [j3]Devesh Agrawal, Deepak Ganesan, Ramesh K. Sitaraman, Yanlei Diao, Shashi Singh:
Lazy-Adaptive Tree: An Optimized Index Structure for Flash Devices. Proc. VLDB Endow. 2(1): 361-372 (2009) - [c17]Yanlei Diao, Boduo Li, Anna Liu, Liping Peng, Charles Sutton, Thanh T. L. Tran, Michael Zink:
Capturing Data Uncertainty in High-Volume Stream Processing. CIDR 2009 - [c16]Desislava Petkova, W. Bruce Croft, Yanlei Diao:
Refining Keyword Queries for XML Retrieval by Combining Content and Structure. ECIR 2009: 662-669 - [c15]Thanh T. L. Tran, Charles Sutton, Richard Cocci, Yanming Nie, Yanlei Diao, Prashant J. Shenoy:
Probabilistic Inference over RFID Streams in Mobile Environments. ICDE 2009: 1096-1107 - [r2]Yanlei Diao, Michael J. Franklin:
Publish/Subscribe over Streams. Encyclopedia of Database Systems 2009: 2211-2216 - [r1]Yanlei Diao, Michael J. Franklin:
XML Publish/Subscribe. Encyclopedia of Database Systems 2009: 3608-3613 - [i2]Yanlei Diao, Boduo Li, Anna Liu, Liping Peng, Charles Sutton, Thanh T. L. Tran, Michael Zink:
Capturing Data Uncertainty in High-Volume Stream Processing. CoRR abs/0909.1777 (2009) - 2008
- [c14]Daniel Gyllstrom, Jagrati Agrawal, Yanlei Diao, Neil Immerman:
On Supporting Kleene Closure over Event Streams. ICDE 2008: 1391-1393 - [c13]Richard Cocci, Thanh T. L. Tran, Yanlei Diao, Prashant J. Shenoy:
Efficient Data Interpretation and Compression over RFID Streams. ICDE 2008: 1445-1447 - [c12]Jagrati Agrawal, Yanlei Diao, Daniel Gyllstrom, Neil Immerman:
Efficient pattern matching over event streams. SIGMOD Conference 2008: 147-160 - [e1]Yanlei Diao, Christian S. Jensen:
Proceedings of the 5th Workshop on Data Management for Sensor Networks, in conjunction with VLDB, DMSN 2008, Auckland, New Zealand, August 24, 2008. ACM International Conference Proceeding Series, ACM 2008, ISBN 978-1-60558-284-9 [contents] - 2007
- [c11]Yanlei Diao, Deepak Ganesan, Gaurav Mathur, Prashant J. Shenoy:
Rethinking Data Management for Storage-centric Sensor Networks. CIDR 2007: 22-31 - [c10]Daniel Gyllstrom, Eugene Wu, Hee-Jin Chae, Yanlei Diao, Patrick Stahlberg, Gordon Anderson:
SASE: Complex Event Processing over Streams (Demo). CIDR 2007: 407-411 - 2006
- [c9]Fang Yu, Zhifeng Chen, Yanlei Diao, T. V. Lakshman, Randy H. Katz:
Fast and memory-efficient regular expression matching for deep packet inspection. ANCS 2006: 93-102 - [c8]Eugene Wu, Yanlei Diao, Shariq Rizvi:
High-performance complex event processing over streams. SIGMOD Conference 2006: 407-418 - [i1]Daniel Gyllstrom, Eugene Wu, Hee-Jin Chae, Yanlei Diao, Patrick Stahlberg, Gordon Anderson:
SASE: Complex Event Processing over Streams. CoRR abs/cs/0612128 (2006) - 2004
- [c7]Yanlei Diao, Shariq Rizvi, Michael J. Franklin:
Towards an Internet-Scale XML Dissemination Service. VLDB 2004: 612-623 - [c6]Yanlei Diao, Daniela Florescu, Donald Kossmann, Michael J. Carey, Michael J. Franklin:
Implementing Memoization in a Streaming XQuery Processor. XSym 2004: 35-50 - 2003
- [j2]Yanlei Diao, Michael J. Franklin:
High-Performance XML Filtering: An Overview of YFilter. IEEE Data Eng. Bull. 26(1): 41-48 (2003) - [j1]Yanlei Diao, Mehmet Altinel, Michael J. Franklin, Hao Zhang, Peter M. Fischer:
Path sharing and predicate evaluation for high-performance XML filtering. ACM Trans. Database Syst. 28(4): 467-516 (2003) - [c5]Yanlei Diao, Michael J. Franklin:
Query Processing for High-Volume XML Message Brokering. VLDB 2003: 261-272 - 2002
- [c4]Yanlei Diao, Peter M. Fischer, Michael J. Franklin, Raymond To:
YFilter: Efficient and Scalable Filtering of XML Documents. ICDE 2002: 341-342 - 2000
- [c3]Yanlei Diao, Hongjun Lu, Dekai Wu:
A Comparative Study of Classification Based Personal E-mail Filtering. PAKDD 2000: 408-419 - [c2]Songting Chen, Yanlei Diao, Hongjun Lu, Zengping Tian:
Fact: A Learning Based Web Query Processing System. SIGMOD Conference 2000: 587 - [c1]Yanlei Diao, Hongjun Lu, Songting Chen, Zengping Tian:
Toward Learning Based Web Query Processing. VLDB 2000: 317-328
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint