Conference Program: SIGMOD Sessions
- Opening Remarks
- Keynotes
- Research Sessions
- Industry Sessions
- Demo Sessions
- Panel Discussions
- Posters
- Tutorials
- Student Research Competition
- New Researcher Symposium
- Sponsors
- DEI
- Award Talks
OPENING REMARKS
Tuesday June 20 8:30 am – 9:00 am
Location: Grand Ballroom ABCDEFG
Session Chair: Sihem Amer-Yahia and K. Selcuk Candan
KEYNOTES
49 Years of Queries
Don Chamberlin, IBM Fellow (retired)
Tuesday June 20 9:00 am – 10:30 am
Location: Grand Ballroom ABCDEFG
Session Chair: Michael Carey
Mixed Methods Machine Learning
Vanessa Murdock, Amazon
Wednesday June 21 8:00 am – 9:30 am
Location: Grand Ballroom ABCDEFG
Session Chair: Lei Chen
DEI Perspectives in Information Technology Education
Shazia Sadiq, The University of Queensland
Wednesday June 21 9:30 am – 10:30 am
Location: Grand Ballroom ABCDEFG
Session Chairs: Yuanyuan Tian
RESEARCH SESSIONS
Session 1: Time Series and data series
Tuesday June 20 11:00 am – 12:30 pm
Location: Evergreen A
Session Chair: Jarek Szlichta
- Time2State: An Unsupervised Framework for Inferring the Latent States in Time Series Data
- Grouping Time Series for Efficient Columnar Storage
- Time Series Data Validity
- Joint Neural Architecture and Hyperparameter Search for Correlated Time Series Forecasting
- Dumpy: A Compact and Adaptive Index for Large Data Series Collections
- ForestTI: A Scalable Inverted-Index-Oriented Timeseries Management System with Flexible Memory Efficiency
Session 2: Privacy, security and encryption, blockchains
Tuesday June 20 11:00 am – 12:30 pm
Location: Evergreen E
Session Chair: Ioannis Demertzis
- Toward Efficient Homomorphic Encryption for Outsourced Databases through Parallel Caching
- RLS Side Channels: Investigating Leakage of Row-Level Security Protected Data Through Query Execution Time
- A Framework for Privacy Preserving Localized Graph Pattern Query Processing
- Measuring Re-identification Risk
- Sequence-Based Target Coin Prediction for Cryptocurrency Pump-and-Dump
- When Private Blockchain Meets Deterministic Database
Session 3: Transactions & Indexing
Tuesday June 20 11:00 am – 12:30 pm
Location: Evergreen F
Session Chair: Vincent Oria
- Circinus: Fast Redundancy-Reduced Subgraph Matching
- I/O-Efficient Butterfly Counting at Scale
- Maximum k-Biplex Search on Bipartite Graphs: A Symmetric-BK Branching Approach
- Scaling Up k-Clique Densest Subgraph Detection
- Maximal Defective Clique Enumeration
- Efficient Biclique Counting in Large Bipartite Graphs
Session 4: Sampling and cardinality estimation
Tuesday June 20 11:00 am – 12:30 pm
Location: Evergreen GH
Session Chair: Graham Cormode
- Efficient Estimation of Pairwise Effective Resistance
- Speeding Up End-to-end Query Execution via Learning-based Progressive Cardinality Estimation
- FactorJoin: A New Cardinality Estimation Framework for Join Queries
- Efficient Sampling Approaches to Shapley Value Approximation
- SafeBound: A Practical System for Generating Cardinality Bounds
- LAQy: Efficient and Reusable Query Approximations via Lazy Sampling
Session 5: Time series and temporal data
Tuesday June 20 2:00 pm – 3:30 pm
Location: Evergreen A
Session Chair: Alfons Kemper
- LightCTS: A Lightweight Framework for Correlated Time Series Forecasting
- T-Rex: Optimizing Pattern Search on Time Series
- OM^3: An Ordered Multi-level Min-Max Representation for Interactive Progressive Visualization of Time Series
- LightTS: Lightweight Time Series Classification with Adaptive Ensemble Distillation
- DAMR: Dynamic Adjacency Matrix Representation Learning for Multivariate Time Series Imputation
- On Querying Spanned Connected Components in Large Temporal Graphs
Session 6: Differential privacy
Tuesday June 20 2:00 pm – 3:30 pm
Location: Evergreen I
Session Chair: Jia Zou
- A Neural Approach to Spatio-Temporal Data Release with User-Level Differential Privacy
- An Effective and Differentially Private Protocol for Secure Distributed Cardinality Estimation
- Practical Differentially Private and Byzantine-resilient Federated Learning
- PrivLava: Synthesizing Relational Data with Foreign Keys under Differential Privacy
- Global and Local Differentially Private Release of Count-Weighted Graphs
- Better than Composition: How to Answer Multiple Relational Queries under Differential Privacy
Session 7: Sampling, cardinality estimation, uncertainties and probabilities
Tuesday June 20 2:00 pm – 3:30 pm
Location: Evergreen F
Session Chair: Avigdor Gal
- Together is Better: Heavy Hitters Latency Quantile Estimation
- Efficient and Effective Cardinality Estimation for Skyline Family
- JoinSketch: A Sketch Algorithm for Accurate and Unbiased Inner-Product Estimation
- Most Expected Winner: An Interpretation of Winners over Uncertain Voter Preferences
- Probabilistic Reasoning at Scale: Trigger Graphs to the Rescue
- rkHit: Representative Query with Uncertain Preference
Session 8: Clustering
Tuesday June 20 4:00 pm – 5:30 pm
Location: Evergreen A
Session Chair: Julia Stoyanovich
- A New Sparse Data Clustering Method Based on Frequent Items
- An Efficient Algorithm for Distance-based Structural Graph Clustering
- Fast Density-Based Clustering: Geometric Approach
- FINEX: A Fast Index for Exact & Flexible Density-Based Clustering
- Efficient and Effective Attributed Hypergraph Clustering via K-Nearest Neighbor Augmentation
- Prerequisite-driven Fair Clustering on Heterogeneous Information Networks
Session 9: Joins
Tuesday June 20 4:00 pm – 5:30 pm
Location: Evergreen E
Session Chair: Ke Yi
- Raster Intervals: An Approximation Technique for Polygon Intersection Joins
- Detecting Logic Bugs of Join Optimizations in DBMS
- Efficiently Computing Join Orders with Heuristic Search
- Ready to Leap (by Co-Design)? Join Order Optimisation on Quantum Hardware
- Design and Analysis of a Processing-in-DIMM Join Algorithm: A Case Study with UPMEM DIMMs
- Free Join: Unifying Worst-Case Optimal and Traditional Joins
Session 10: Learning, embeddings and analytics on graphs
Tuesday June 20 4:00 pm – 5:30 pm
Location: Evergreen F
Session Chair: Sibo Wang
- CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression
- Making It Tractable to Catch Duplicates and Conflicts in Graphs
- Grep: A Graph Learning Based Database Partitioning System
- Efficient Tree-SVD for Subset Node Embedding over Large Dynamic Graphs
- Graph Learning for Interaction Analysis in Smart Home Rule Data
- T-FSM: A Task-Based System for Massively Parallel Frequent Subgraph Pattern Mining from a Big Graph
Session 11: Data Models, Semantics, and Integration
Tuesday June 20 4:00 pm – 5:30 pm
Location: Evergreen GH
Session Chair: Mourad Ouzzani
- Learned Data-aware Image Representations of Line Charts for Similarity Search
- Discovering Similarity Inclusion Dependencies
- SANTOS: Relationship-based Semantic Table Union Search
- Composite Object Normal Forms
- Discovering Top-k Rules using Subjective and Objective Criteria
- Exploratory Training: When Annonators Learn About Data
Session 12: Transactions
Tuesday June 20 4:00 pm – 5:30 pm
Location: Evergreen I
Session Chair: Tianzheng Wang
- Transaction Scheduling: From Conflicts to Runtime Conflicts
- MRV: Enforcing Numeric Invariants in Parallel Updates to Hotspots with Randomized Splitting
- Polaris: Enabling Transaction Priority in Optimistic Concurrency Control
- DBPA: A Benchmark for Transactional Database Performance Anomalies
- Detock: High Performance Multi-region Transactions at Scale
- One-shot garbage collection for in-memory OLTP through temporality-aware version storage
Session 13: Ramdom walks and reachability on graphs
Wednesday June 21 11:00 am – 12:30 pm
Location: Evergreen A
Session Chair: Mirek Riedewald
- Personalized PageRank on Evolving Graphs with an Incremental Index-Update Scheme
- Towards Generating Hop-constrained s-t Simple Path Graphs
- Effective and Efficient PageRank-based Positioning for Graph Visualization
- LightRW: FPGA Accelerated Graph Dynamic Random Walks
- Parallel Strong Connectivity Based on Faster Reachability
- HR-Index: An Effectiveness Index Method for Historical Reachability Queries over Evolving Graphs
Session 14: Streams
Wednesday June 21 11:00 am – 12:30 pm
Location: Evergreen E
Session Chair: Silu Huang
- Fast Continuous Subgraph Matching over Streaming Graphs via Backtracking Reduction
- MorphStream: Adaptive Scheduling for Scalable Transactional Stream Processing on Multicores
- INEv: In-Network Evaluation for Event Stream Processing
- Pontus: Finding Waves in Data Streams
- Data Stream Clustering: An In-depth Empirical Study
- Ghost: A General Framework for High-Performance Online Similarity Queries over Distributed Trajectory Streams
Session 15: Spatial and temporal data
Wednesday June 21 11:00 am – 12:30 pm
Location: Evergreen F
Session Chair: Amr Magdy and Ahmed Eldawy
- Effectiveness Perspectives and a Deep Relevance Model for Spatial Keyword Queries
- EAR-Oracle: On Efficient Indexing for Distance Queries between Arbitrary Points on Terrain Surface
- Spatio-Temporal Denoising Graph Autoencoders with Data Augmentation for Photovoltaic Data Imputation
- Caerus: A Caching-based Framework for Scalable Temporal Graph Neural Networks
- GeoGauss: Strongly Consistent Coordinator-Free OLTP for Geo-Replicated SQL Database
- The RLR-Tree: A Reinforcement Learning Based R-Tree for Spatial Data
Session 16: Query Optimization
Wednesday June 21 11:00 am – 12:30 pm
Location: Evergreen GH
Session Chair: Dan Suciu
- Exploiting Structure in Regular Expression Queries
- Computing the Difference of Conjunctive Queries Efficiently
- Selection Pushdown in Column Stores using Bit Manipulation Instructions
- Efficient Query Re-optimization with Judicious Subquery Selections
- Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines
- Query-Guided Resolution of Uncertain Databases
Session 17: DB4ML
Wednesday June 21 2:00 pm – 3:30 pm
Location: Evergreen A
Session Chair: Fatma Ozcan
- Incremental Tabular Learning on Heterogeneous Feature Space
- FEAST: A Communication-efficient Federated Feature Selection Framework for Relational Data
- FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement
- ML2DAC: Meta-Learning to Democratize AutoML for Clustering Analysis
- GoodCore: Coreset Selection over Incomplete Data for Data-effective and Data-efficient Machine Learning
- DeltaBoost: Gradient Boosting Decision Trees with Efficient Machine Unlearning
Session 18: Subgraph matching and counting
Wednesday June 21 2:00 pm – 3:30 pm
Location: Evergreen E
Session Chair: Byron Choi
- Efficient Star-based Truss Maintenance on Dynamic Graphs
- Hereditary Cohesive Subgraphs Enumeration on Bipartite Graphs: The Power of Pivot-based Approaches
- GuP: Fast Subgraph Matching by Guard-based Pruning
- Efficient and Effective Algorithms for Generalized Densest Subgraph Discovery
- Efficient GPU-Accelerated Subgraph Matching
- Theories and Principles Matter: Towards Visually Appealing and Effective Abstraction of Property Graph Queries
Session 19: Coordination, distribution and clouds
Wednesday June 21 2:00 pm – 3:30 pm
Location: Evergreen F
Session Chair: Michael Mior
- Incentive-Aware Decentralized Data Collaboration
- A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning
- dsJSON: A Distributed SQL JSON Processor
- Generalizing Bulk-Synchronous Parallel Processing for Data Science: from data to threads and agent-based simulations
- DARQ Matter Binds Everything: Performant and Composable Cloud Programming via Resilient Steps
- Using Cloud Functions as Accelerator for Elastic Data Analytics
Session 20: Spatial and temporal data
Wednesday June 21 2:00 pm – 3:30 pm
Location: Evergreen GH
Session Chair: Cheng Long
- Matching Roles from Temporal Data
- ST4ML: Machine Learning Oriented Spatio-Temporal Data Processing at Scale
- Mining Geospatial Relationships from Text
- SSIN: Self-Supervised Learning for Rainfall Spatial Interpolation
- WISK: A Workload-aware Learned Index for Spatial Keyword Queries
- QHL: A Fast Algorithm for Exact Constrained Shortest Path Search on Road Networks
Session 21: ML4DB and Outlier detection
Thursday June 22 10:00 am – 11:30 am
Location: Evergreen A
Session Chair: Michael Gubanov
- Detect, Distill and Update: Learned DB Systems Facing Out of Distribution Data
- BALANCE: Bayesian Linear Attribution for Root Cause Localization
- Kepler: Robust Learning for Parametric Query Optimization
- XInsight: eXplainable Data Analysis Through The Lens of Causality
- AutoOD: Automatic Outlier Detection
- Robust and Transferable Log-based Anomaly Detection
Session 22: Knowledge graphs and data integration
Thursday June 22 10:00 am – 11:30 am
Location: Evergreen E
Session Chair: Fatemeh Nargesian
- A Universal Question-Answering Platform for Knowledge Graphs
- Deep Active Alignment of Knowledge Graph Entities and Schemata
- Maestro: Automatic Generation of Comprehensive Benchmarks for Question Answering Over Knowledge Graphs
- Ground Truth Inference for Weakly Supervised Entity Matching
- FlexER: Flexible Entity Resolution for Multiple Intents
- Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration
Session 23: Indexing and estimation
Thursday June 22 10:00 am – 11:30 am
Location: Evergreen F
Session Chair: Zhuoyue Zhao
- When Tree Meets Hash: Reducing Random Reads for Index Structures on Persistent Memories
- NeuroSketch: Fast and Approximate Evaluation of Range Aggregate Queries with Neural Networks
- Pea Hash: A Performant Extendible Adaptive Hashing Index
- Updatable Learned Indexes Meet Disk-Resident DBMS - From Evaluations to Design Choices
- InfiniFilter: Expanding Filters to Infinity and Beyond
- A Step Toward Deep Online Aggregation
Session 24: Big Data analytics and data science pipelines
Thursday June 22 2:00 pm – 3:30 pm
Location: Evergreen A
Session Chair: Brit Youngmann
- HybridPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation
- GIO: Generating Efficient Matrix and Frame Readers for Custom Data Formats by Example
- Predicate Pushdown for Data Science Pipelines
- DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data
- Runtime Variation in Big Data Analytics
- QaaD (Query-as-a-Data): Scalable Execution of Massive Number of Small Queries in Spark
Session 25: Indexing and similarity search
Thursday June 22 2:00 pm – 3:30 pm
Location: Evergreen E
Session Chair: Dong Xie
- Efficient Approximate Nearest Neighbor Search in Multi-dimensional Databases
- High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations
- SplinterDB and Maplets: Improving the Tradeoffs in Key-Value Store Compaction Policy
- IcebergHT: High Performance PMEM Hash Tables Through Stability and Low Associativity
- Hamming Tree: The case for Energy-Aware Indexing for NVMs
- LiteHST: A Tree Embedding based Method for Similarity Search
Session 26: Graphs
Thursday June 22 2:00 pm – 3:30 pm
Location: Evergreen F
Session Chair: Senjuti Basu Roy
- TED: Towards Discovering Top-đ Edge-Diversified Patterns in a Graph Database
- Shortest Paths Discovery in Uncertain Networks via Transfer Learning
- Efficient Personalized PageRank Computation: The Power of Variance-Reduced Monte Carlo Approaches
- Efficient Resistance Distance Computation: the Power of Landmark-based Approaches
- GraphINC: Graph Pattern Mining at Network Speed
- Scapin: Scalable Graph Structure Perturbation by Augmented Influence Maximization
Session 27: Modern hardware, performance, and benchmarking
Thursday June 22 2:00 pm – 3:30 pm
Location: Evergreen GH
Session Chair: Zhichao Cao
- Design Guidelines for Correct, Efficient, and Scalable Synchronization using One-Sided RDMA
- Distributed GPU Joins on Fast RDMA-capable Networks
- ClipSim: A GPU-friendly Parallel Framework for Single-Source SimRank with Accuracy Guarantee
- Virtual-Memory Assisted Buffer Management
- Optimizing Tensor Programs on Flexible Storage
- How To Optimize My Blockchain? A Multi-Level Recommendation Approach
Session 28: Data mining and discovery
Thursday June 22 4:00 pm – 5:30 pm
Location: Evergreen A
Session Chair: Felix Naumann
- Regularized Pairwise Relationship based Analytics for Structured Data
- Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning
- GitTables: A Large-Scale Corpus of Relational Tables
- Near-Duplicate Sequence Search at Scale for Neural Language Model Memorization Evaluation
- Unsupervised Hashing with Semantic Concept Mining
- FEC: Efficient Deep Recommendation Model Training with Flexible Embedding Communication
Session 29: Compression and fairness
Thursday June 22 4:00 pm – 5:30 pm
Location: Evergreen E
Session Chair: Abolfazl Asudeh
- Double-Anonymous Sketch: Achieving Fairness for Finding Global Top-K Frequent Items
- LadderFilter: Filtering Infrequent Items with Small Memory and Time Overhead
- TowerSensing: Linearly Compressing Sketches with Flexibility
- iFlipper: Label Flipping for Individual Fairness
- Hierarchical Residual Encoding for Multiresolution Compression
- BtrBlocks: Efficient Columnar Compression for Data Lakes
Session 30: Diffusion and Propagation in Graphs
Thursday June 22 4:00 pm – 5:30 pm
Location: Evergreen F
Session Chair: Behrooz Omidvar-Tehrani
- DUCATI: A Dual-Cache Training System for Graph Neural Networks on Giant Graphs with GPU
- Scalable and Efficient Full-Graph GNN Training for Large Graphs
- Managing Conflicting Interests of Stakeholders in Influencer Marketing
- EARLY: Efficient and Reliable Graph Neural Network for Dynamic Graphs
- Mitigating Filter Bubbles Under a Competitive Diffusion Model
- Popularity Ratio Maximization: Surpassing Competitors through Influence Propagation
Session 31: Optimizing data systems
Thursday June 22 4:00 pm – 5:30 pm
Location: Evergreen GH
Session Chair: Alekh Jindal
- LinCQA: Faster Consistent Query Answering with Linear Time Guarantees
- Data-Sharing Markets: Model, Protocol, and Algorithms to Incentivize the Formation of Data-Sharing Consortia
- Foreign Keys Open the Door for Faster Incremental View Maintenance
- Efficient and Portable Einstein Summation in SQL
- dbET: Execution Time Distribution-based Plan Selection
- AWARE: Workload-aware, Redundancy-exploiting Linear Algebra
INDUSTRY SESSIONS
Session 1
Tuesday June 20 2:00 pm – 3:30 pm
Location: Evergreen E
Session Chair: Yiwen Zhu
- Disaggregating RocksDB: A Production Experience
- GoldMiner: Elastic Scaling of Training Data Pre-Processing Pipelines for Deep Learning
- VeDB: A Software and Hardware Enabled Trusted Relational Database
- Presto: A Decade of SQL Analytics at Meta
- Keep Your Distributed Data Warehouse Consistent at a Minimal Cost
- Whatâs the difference? Incremental processing with change queries in Snowflake
- Apache IoTDB: A Time Series Database for IoT Applications
Industry Invited Papers
Wednesday June 21 2:00 pm – 3:30 pm
Location: Grand K
Session Chair: Hakan Hacigumus
- Auto-WLM: ML-enhanced workload management in Amazon Redshift
- DataChat: An Intuitive and Collaborative Data Analytics Platform
- Towards Building Autonomous Data Services on Azure
- Making Data Clouds Smarter at Keebo: Automated Warehouse Optimization using Data Learning
- Growing and Serving Large Open-domain Knowledge Graphs
Session 2
Thursday June 22 4:00 pm – 5:30 pm
Location: Evergreen B
Session Chair: Shaleen Deep
- PolarDB-IMCI:A Cloud-Native HTAP Database System at Alibaba
- Vineyard: Optimizing Data Sharing in Data-Intensive Analytics
- Steered Training Data Generation for Learned Semantic Type Detection
- When Automatic Filtering Comes to the Rescue: Pre-Computing Company Competitor Pairs in Owler
- High-Throughput Vector Similarity Search in Knowledge Graphs
- PG-Schemas: Schemas for Property Graphs
- GeaFlow: A Graph Extended and Accelerated Dataflow System
DEMO SESSIONS
Group A
Tuesday June 20 11:00 am – 12:30 pm
Location: Grand IJ
Thursday June 22 2:00 pm – 3:30 pm
Location: Grand IJ
- Proactively Screening Machine Learning Pipelines with ArgusEyes
- PyNKDV: An Efficient Network Kernel Density Visualization Library for Geospatial Analytic Systems
- ARENA: Alternative Relational Query Plan Exploration for Database Education
- Pay âAttentionâ to Chart Images for What You Read on Text
- Demonstrating MATE and COCOA for Data Discovery
- Demonstration of Geyser: Provenance Extraction and Applications over Data Science Scripts
- SMILE: A Cost-Effective System for Serving Massive Pretrained Language Models in the Cloud
- Demonstrating NaturalMiner: Searching Large Data Sets for Abstract Patterns Described in Natural Language
- Characterizing and Verifying Queries Via CInsGen
- CoWrangler: Recommender System for Data-Wrangling Scripts
- SHACTOR: Improving the Quality of Large-Scale Knowledge Graphs with Validating Shapes
- Fast Natural Language Based Data Exploration with Samples
- ATENA-PRO: Generating Personalized Exploration Notebooks with Constrained Reinforcement Learning
- Demonstration of ThalamusDB: Answering Complex SQL Queries with Natural Language Predicates on Multi-Modal Data
Group B
Tuesday June 20 2:00 pm – 3:30 pm
Location: Grand IJ
Thursday June 22 10:00 am – 11:30 am
Location: Grand IJ
- Dike: A Benchmark Suite for Distributed Transactional Databases
- BCNF* - From Normalized- to Star-Schemas and Back Again
- SparkSQL+: Next-generation Query Planning over Spark
- SCAD: A Scalability Advisor for Interactive Microservices on Hybrid Clouds
- Acheron: Persisting Tombstones in LSM Engines
- Dexer: Detecting and Explaining Biased Representation in Ranking
- TeeBench: Seamless Benchmarking in Trusted Execution Environments
- NEXUS: On Explaining Confounding Bias
- Aggregation and Exploration of High-Dimensional Data Using the Sudokube Data Cube Engine
- SmokedDuck Demonstration: SQLStepper
- DIALITE: Discover, Align and Integrate Open Data Tables
- A Demonstration of KAMEL: A Scalable BERT-based System for Trajectory Imputation
- A Demonstration of GeoTorchAI: A Spatiotemporal Deep Learning Framework
- Efficient Query Processing in Python Using Compilation
PANEL DISCUSSIONS
Future of Database System Architectures
Tuesday June 20 2:00 pm – 3:30 pm
Location: Evergreen E
Moderator: Raghu Ramakrishnan, Microsoft
Panelists:
- Gustavo Alonso, ETH
- Natassa Ailamaki, EPFL
- Sailesh Krishnamurthy, Google
- Sam Madden, MIT
- Swami Sivasubramanian, AWS
Personal Data for Personal Use: Vision or Reality?
Thursday June 22 10:00 am – 11:30 am
Location: Evergreen GH
Moderators: Alon Halevy and Wang-Chiew Tan, Meta
Panelists:
- Xin Luna Dong, Meta
- Bo Li, University of Illinois
- Julia Stoyanovich, NYU
- Anthony Kum Hoe Tung, University of Singapore
- Gerhard Weikum, MPI
POSTER SESSIONS
Session 1
Wednesday June 21 4:00 pm – 5:30 pm
Location: Regency/Cedar Ballrooms and Regency Foyer
Session 2
Thursday June 22 11:30 am – 12:30 pm
Location: Regency/Cedar Ballrooms and Regency Foyer
TUTORIAL SESSIONS
Tutorial 1: Main memory database recovery strategies
Sunday June 18 9:00 am – 10:30 am
Location: Evergreen I
Slides
Tutorial 7: Demystifying Artificial Intelligence for Data Preparation
Sunday June 18 11:00 am – 12:30 am
Location: Evergreen I
Slides
Tutorial 2: Quantum Machine Learning: Foundation, New techniques, and Opportunities for Database Research
Thursday June 22 2:00 pm – 3:30 pm
Thursday June 22 4:00 pm – 5:30 pm
Location: Grand K
Slides
Tutorial 3: An Overview of Reachability Indexes on Graphs
Thursday June 22 2:00 pm – 3:30 pm
Thursday June 22 4:00 pm – 5:30 pm
Location: Evergreen I
Slides
Tutorial 4: Fairness in Ranking: From Values to Technical Choices and Back
Thursday June 22 2:00 pm – 3:30 pm
Thursday June 22 4:00 pm – 5:30 pm
Location: Evergreen C
Slides
Tutorial 5: Optimizing Tensor Computations: From Applications to Compilation and Runtime Techniques
Friday June 23 9:00 am – 10:30 am
Location: Regency A
Tutorial 6: Large-scale Geospatial Analytics: Problems, Challenges, and Opportunities
Friday June 23 9:00 am – 10:30 am
Location: Cedar B
Slides
Tutorial 8: Table Discovery in Data Lakes: State-of-the-art and Future Directions
Sunday June 18 1:30 pm – 3:00 pm
Location: Evergreen I
Tutorial 9: Disaggregated Database Systems
Friday June 23 11:00 am – 12:30 pm
Location: Cedar B
Tutorial 10: Data Processing with FPGAs on Modern Architectures
Friday June 23 1:30 pm – 3:00 pm
Friday June 23 3:30 pm – 5:00 pm
Location: Regency A
Tutorial 11: Models and Practice of Neural Table Representations
Friday June 23 1:30 pm – 3:00 pm
Friday June 23 3:30 pm – 5:00 pm
Location: Cedar B
STUDENT RESEARCH COMPETITION
Session 1
Tuesday June 20 11:00 pm – 12:30 pm
Location: Regency Ballroom
Session chairs: Abolfadl Asudeh and Fatemeh Nargesian
Session 2
Wednesday June 21 4:00 pm – 5:30 pm
Location: Evergreen I
Session chairs: Abolfadl Asudeh and Fatemeh Nargesian
NEW RESEARCHER SYMPOSIUM
Tuesday June 20 6:30 pm - 8:30 pm
Location: Evergreen A
Session Chairs: Jia Zou, Arizona State University and Parth Nagarkar, New Mexico State University
Panelists:
- Leilani Battle, University of Washington
- Chris Jermaine, Rice University
- Senjuti Basu Roy, New Jersey Institute of Technology
- Mo Sarwat, Wherobots
- Erkang Zhu (Eric Zhu), Microsoft Research
SPONSORS
Tuesday June 20 4:00 pm – 5:45 pm
Location: Grand K
Session Chairs: Justin Levandoski
- Amazon: Innovation in AWS Database Services — Marc Brooker
- Microsoft: Microsoft Fabric - Analytics in the AI Era — Raghu Ramakishnan
- Google: Data and AI at Google BigQuery Scale — Tomas Talius
- Alibaba: Enhancing Database Systems with AI — by Bolin Ding and Jingren Zhou
- Confluent: Consensus in Apache Kafka: from Theory to Production — Jason Gustafson and Guozhang Wang
- Salesforce: Enterprise and the Cloud: Why Is It Challenging? — Pat Helland
- Databricks: The best warehouse is a Lakehouse — Ryan Johnson
DEI
DEI pairings (newcomer,senior)
Monday June 19 Lunch
Chairs: Carlo Curino and Jesus Camacho Rodriguez
Bird-of-a-feather feedback/working hour (50 people)
Wednesday June 21 11:00 am – 12:30 pm
Location: Evergreen I
Chairs: Avrilia Floratou
AWARD TALKS
Thursday June 22 8:00 am – 9:30 am
Location: Grand ABCDEFG
Session chair: Divy Agrawal