Shivnath Babu

D338 Levine Science Research Center
Department of Computer Science 
Duke University, Durham, NC 27708 
shivnath@cs.my_university.edu
Phone: 919-660-6579
Fax: 919-660-6519

    Conference Publications

  1. H. Lim, H. Herodotou, and S. Babu. Stubby: A Transformation-based Optimizer for MapReduce Workflows
    In Proc. of the 2012 Intl. Conference on Very Large Data Bases (VLDB), August 2012 (To appear)

  2. H. Herodotou, F. Dong, and S. Babu. No One (Cluster) Size Fits All: Automatic Cluster Sizing for Data-intensive Analytics
    In Proc. of the ACM Symposium on Cloud Computing 2011 (ACM SOCC 2011), October 2011 (Acceptance rate: 16.8%)

  3. H. Herodotou and S. Babu. Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs
    In Proc. of the 2011 Intl. Conference on Very Large Data Bases (VLDB), August 2011 (Acceptance rate: unknown; less than 20% in previous years)

  4. N. Borisov, S. Babu, N. Mandagere, and S. Uttamchandani. Warding off the Dangers of Data Corruption with Amulet
    In Proc. of the 2011 ACM Intl. Conf. on Management of Data (SIGMOD), June 2011 (Acceptance rate: 23.2%)

  5. H. Herodotou, N. Borisov, and S. Babu. Query Optimization Techniques for Partitioned Tables
    In Proc. of the 2011 ACM Intl. Conf. on Management of Data (SIGMOD), June 2011 (Acceptance rate: 23.2%)

  6. M. Ahmad, S. Duan, A. Aboulnaga, and S. Babu. Predicting Completion Times of Batch Query Workloads using Interaction-aware Models and Simulation
    In Proc. of the Intl. Conference on Extending Database Technology (EDBT), March 2011 (Acceptance rate: 27.7%)

  7. H. Herodotou, H. Lim, G. Luo, N. Borisov, L. Dong, F. B. Cetin, and S. Babu. Starfish: A Self-tuning System for Big Data Analytics
    In Proc. of the Fifth Biennial Conference on Innovative Data Systems Research (CIDR), January 2011 (Acceptance rate: unknown; less than 30% in previous years)

  8. H. Herodotou and S. Babu. Xplus: A SQL-Tuning-Aware Query Optimizer
    In Proc. of PVLDB Volume 3 (the International Conference on Very Large Databases (VLDB)), September 2010 (Acceptance rate: unknown; less than 20% in previous years)

  9. S. Babu. Towards Automatic Optimization of MapReduce Programs
    In Proc. of the ACM Symposium on Cloud Computing 2010 (ACM SOCC 2010), June 2010 (Acceptance rate: 19%)

  10. H. Lim, S. Babu and J. Chase. Automated Control for Elastic Storage
    In Proc. of the Intl. Conference on Autonomic Computing (ICAC 2010), June 2010 (Acceptance rate: 26.5%)

  11. M. Ahmad, S. Duan, A. Aboulnaga, and S. Babu. Interaction-aware Prediction of Business Intelligence Workload Completion Times
    In Proc. of 2010 IEEE International Conference on Data Engineering (ICDE), April 2010 (short paper) (Acceptance rate: 21%)

  12. S. Duan, V. Thummala, and S. Babu. Tuning Database Configuration Parameters with iTuned
    In Proc. of the International Conference on Very Large Databases (VLDB), August 2009 (Acceptance rate: 16.7%)

  13. S. Duan, S. Babu, and K. Munagala. Fa: A System for Automating Failure Diagnosis
    In Proc. of 2009 IEEE International Conference on Data Engineering (ICDE), April 2009 (Acceptance rate: 17%)

  14. S. Babu, N. Borisov, S. Uttamchandani, R. Routray, and A. Singh. DIADS: Addressing the "My-Problem-or-Yours" Syndrome with Integrated SAN and Database Diagnosis
    In Proc. of the USENIX Conference on File and Storage Technologies (FAST), February 2009 (Acceptance rate: 22.5%)

  15. M. Ahmad, A. Aboulnaga, S. Babu, and K. Munagala. Modeling and Exploiting Query Interactions in Database Systems
    In Proc. of ACM International Conference on Information and Knowledge Management (CIKM), October 2008 (Acceptance rate: 17%)

  16. R. Thonangi, V. Thummala, and S. Babu. Finding Good Configurations in High-Dimensional Spaces: Doing More with Less
    In Proc. of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS), September 2008 (Acceptance rate: 38.3%)

  17. P. Shivam, V. Marupadi, J. Chase, and S. Babu. Cutting Corners: Workbench Automation for Server Benchmarking
    In Proc. of the 2008 USENIX Annual Technical Conference, June 2008 (Acceptance rate: 19.3%)

  18. S. Duan and S. Babu. Guided Problem Diagnosis through Active Learning
    In Proc. of the International Conference on Autonomic Computing (ICAC), June 2008 (Acceptance rate: 25%)

  19. S. Babu, S. Duan, and K. Munagala. Processing Diagnosis Queries: A Principled and Scalable Approach
    In Proc. of the 2008 International Conference on Data Engineering (ICDE), April 2008 (short paper)

  20. M. Ahmad, A. Aboulnaga, S. Babu, and K. Munagala. QShuffler: Getting the Query Mix Right
    In Proc. of the 2008 International Conference on Data Engineering (ICDE), April 2008 (short paper)

  21. S. Duan and S. Babu. Processing Forecasting Queries
    In Proc. of the International Conference on Very Large Databases (VLDB), September 2007 (Acceptance rate: 16.9%)

  22. B. Chandramouli, C. Bond, S. Babu, and J. Yang. Query Suspend and Resume
    In Proc. of the 2007 ACM Intl. Conf. on Management of Data (SIGMOD), June 2007 (Acceptance rate: 14.6%)

  23. B. Chandramouli, C. Bond, S. Babu, and J. Yang. On Suspending and Resuming Dataflows (short paper).
    In Proc. of IEEE International Conference on Data Engineering (ICDE), April 2007 (Acceptance rate: 27.6%)

  24. P. Shivam, S. Babu, and J. Chase. Active and Accelerated Learning of Cost Models for Optimizing Scientific Applications
    In Proc. of the International Conference on Very Large Databases (VLDB), September 2006 (Acceptance rate: 13.3%)

  25. P. Shivam, S. Babu, and J. Chase. Learning Application Models for Utility Resource Planning
    In Proc. of IEEE International Conference on Autonomic Computing (ICAC), June 2006 (Acceptance rate: 21%)

  26. P. Bizarro, S. Babu, D. DeWitt, and J. Widom. Content-Based Routing: Different Plans for Different Data
    In Proc. of the 2005 Conference on Very Large Databases (VLDB 2005), September 2005 (Acceptance rate: 16.4%)

  27. S. Babu, P. Bizarro, and D. DeWitt. Proactive Re-optimization
    In Proc. of the 2005 ACM Intl. Conf. on Management of Data (SIGMOD 2005), June 2005 (Acceptance rate: 15.1%)
    The Rio system described in this paper was demonstrated at SIGMOD 2005, June 2005

  28. S. Babu, K. Munagala, J. Widom, and R. Motwani. Adaptive Caching for Continuous Queries
    In Proc. of the Intl. Conf. on Data Engineering (ICDE 2005), April 2005 (Acceptance rate: 12.9%)

  29. S. Babu and P. Bizarro. Adaptive Query Processing in the Looking Glass
    In Proc. of the Second Biennial Conference on Innovative Data Systems Research (CIDR), January 2005

  30. K. Munagala, S. Babu, R. Motwani, and J. Widom. The Pipelined Set Cover Problem
    In Proc. of the Tenth Intl. Conf. on Database Theory (ICDT 2005), January 2005 (Acceptance rate: 28.6%)

  31. S. Babu, R. Motwani, K. Munagala, I. Nishizawa, and J. Widom. Adaptive Ordering of Pipelined Stream Filters
    In Proc. of the 2004 ACM Intl. Conf. on Management of Data (SIGMOD 2004), June 2004 (Acceptance rate: 16%)

  32. B. Babcock, S. Babu, M. Datar, and R. Motwani. Chain: Operator Scheduling for Memory Minimization in Data Stream Systems
    In Proc. of the 2003 ACM Intl. Conf. on Management of Data (SIGMOD 2003), June 2003 (Acceptance rate: 15.5%)

  33. R. Motwani, J. Widom, A. Arasu, B. Babcock, S. Babu, M. Datar, G. Manku, C. Olston, J. Rosenstein, and R. Varma. Query Processing, Approximation, and Resource Management in a Data Stream Management System
    In Proc. of the First Biennial Conference on Innovative Data Systems Research (CIDR 2003), Jan. 2003

  34. A. Arasu, B. Babcock, S. Babu, J. McAlister, and J. Widom. Characterizing Memory Requirements for Queries over Continuous Data Streams
    In Proc. of the 2002 ACM Symp. on Principles of Database Systems (PODS 2002), June 2002 (Acceptance rate: 22%)

  35. B. Babcock, S. Babu, M. Datar, R. Motwani, and J. Widom. Models and Issues in Data Stream Systems
    In Proc. of the 2002 ACM Symp. on Principles of Database Systems (PODS 2002), June 2002

  36. S. Babu, M. Garofalakis, and R. Rastogi. SPARTAN: A Model-Based Semantic Compression System for Massive Data Tables
    In Proc. of the 2001 ACM Intl. Conf. on Management of Data (SIGMOD 2001), May 2001 (Acceptance rate: 15%)
  37. Workshop Papers

  38. N. Borisov, S. Babu, N. Mandagere, and S. Uttamchandani. Dealing Proactively with Data Corruption: Challenges and Opportunities
    In Proc. of the Sixth Intl. Workshop on Self-Managing Database Systems (SMDB), April 2011

  39. H. Herodotou and S. Babu. Automated SQL Tuning through Trial and (Sometimes) Error
    In Proc. of the Second Workshop on Testing Database Systems (DBTest), June 2009

  40. M. Ahmad, A. Aboulnaga, and S. Babu. Query Interactions in Database Workloads
    In Proc. of the Second Workshop on Testing Database Systems (DBTest), June 2009

  41. A. Demberel, J. Chase, and S. Babu. Reflective Control for an Elastic Cloud Application: An Automated Experiment Workbench
    In Proc. of the First Workshop on Hot Topics in Cloud Computing (HotCloud), in conjunction with USENIX Annual Technical Conference, June 2009 (Acceptance rate: 32%)

  42. H. Lim, S. Babu, J. Chase, and S. Parekh. Automated Control in Cloud Computing: Challenges and Opportunities
    In Proc. of the First Workshop on Automated Control for Datacenters and Clouds, June 2009

  43. S. Babu, N. Borisov, S. Duan, H. Herodotou, and V. Thummala. Automated Experiment-Driven Management of (Database) Systems
    In Proc. of the 12th Workshop on Hot Topics in Operating Systems (HotOS), May 2009 (Acceptance rate: unknown; less than 20% in previous years)

  44. S. Duan and S. Babu. Empirical Comparison of Techniques for Automated Failure Diagnosis
    In Proc. of the Third Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML), December 2008

  45. S. Babu. Grand Challenge: Experiment-driven Adaptive Systems
    Vision paper written for invitation to the third Workshop on Hot Topics in Autonomic Computing (HotAC III), June 2008

  46. A. Yumerefendi, P. Shivam, D. Irwin, P. Gunda, L. Grit, A. Demberel, J. Chase, and S. Babu. Towards an Autonomic Computing Testbed
    In Workshop on Hot Topics in Autonomic Computing (HotAC), June 2007

  47. B. Cook, S. Babu, G. Candea, and S. Duan. Towards Self-Healing Multitier Services
    In Second Intl. Workshop on Self-Managing Database Systems (SMDB), April 2007

  48. P. Shivam, S. Babu, and J. Chase. Active Sampling for Accelerated Learning of Performance Models
    In Proc. of the First Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML), June 2006

  49. A. Arasu, S. Babu, and J. Widom. CQL: A Language for Continuous Queries over Streams and Relations
    In Proc. of the 9th International Workshop on Database Programming Languages, (DBPL 2003), September 2003

  50. U. Srivastava, S. Babu, and J. Widom. Monitoring Stream Properties for Continuous Query Processing
    In Proc. of the 2003 Workshop on Management and Processing of Data Streams (MPDS 2003), June 2003

  51. S. Babu, L. Subramanian, and J. Widom. A Data Stream Management System for Network Traffic Management
    In Proc. of the 2001 Workshop on Network-Related Data Management (NRDM 2001), May 2001

  52. S. Babu, M. Garofalakis, and R. Rastogi. Model-Based Semantic Compression for Network-Data Tables
    In Proc. of the 2001 Workshop on Network-Related Data Management (NRDM 2001), May 2001
  53. Journal Publications

  54. M. Ahmad, A. Aboulnaga, S. Babu, and K. Munagala. Interaction-aware Scheduling of Report Generation Workloads
    In the VLDB Journal, 2011

  55. A. Arasu, S. Babu, and J. Widom. The CQL Continuous Query Language: Semantic Foundations and Query Execution
    In the International Journal on Very Large Databases (VLDB Journal), 15:2 121-142, June 2006

  56. B. Babcock, S. Babu, M. Datar, R. Motwani, and D. Thomas. Operator Scheduling in Data Stream Systems
    In International Journal on Very Large Data Bases (VLDB Journal), December 2004

  57. S. Babu, U. Srivastava, and J. Widom. Exploiting k-Constraints to Reduce Memory Overhead in Continuous Queries over Data Streams
    In ACM Transactions on Database Systems (TODS), September 2004

  58. A. Arasu, B. Babcock, S. Babu, J. McAlister, and J. Widom. Characterizing Memory Requirements for Queries over Continuous Data Streams
    In ACM Transactions on Database Systems (TODS), March 2004

  59. S. Kowshik, S. Babu, G. Manimaran, C. Siva Ram Murthy. A Distributed Real-Time MAC Protocol for WDM-based LANs
    In Computer Communications, April 2001
  60. Demonstrations

  61. H. Herodotou, F. Dong, and S. Babu.
    MapReduce Programming and Cost-based Optimization? Crossing this Chasm with Starfish.
    Demonstrated at the 2011 International Conference on Very Large Data Bases (VLDB), August 2011 (Acceptance rate: 31.5%)

  62. N. Borisov and S. Babu. Proactive Detection and Repair of Data Corruption: Towards a Hassle-free Declarative Approach with Amulet
    Demonstrated at the 2011 International Conference on Very Large Data Bases (VLDB), August 2011 (Acceptance rate: 31.5%)

  63. V. Thummala and S. Babu. A Tool for Configuring and Visualizing Database Parameters
    Winner of the SIGMOD'10 Best-demo Award Competition!
    Demonstrated at the 2010 ACM Intl. Conf. on Management of Data (SIGMOD 2010), June 2010 (Acceptance rate: 36.8%)

  64. N. Borisov, S. Babu, S. Uttamchandani, R. Routray, and A. Singh. DIADS: A Problem Diagnosis Tool for Databases and Storage Area Networks
    Demonstrated at the 2009 International Conference on Very Large Databases (VLDB), August 2009 (Acceptance rate: 27.1%)

  65. S. Duan and S. Babu. Automated Diagnosis of System Failures with Fa
    Demonstrated at the 2009 IEEE International Conference on Data Engineering (ICDE), April 2009 (Acceptance rate: 28.3%)

  66. S. Duan, P. Franklin, V. Thummala, D. Zhao, and S. Babu. Shaman: A Self-Healing Database System
    Demonstrated at the 2009 IEEE International Conference on Data Engineering (ICDE), April 2009 (Acceptance rate: 28.3%)

  67. P. Shivam, A. Demberel, P. Gunda, D. Irwin, L. Grit, A. Yumerefendi, S. Babu, and J. Chase.
    Automated and On-Demand Provisioning of Virtual Machines for Database Applications
    Demonstrated at the 2007 ACM Intl. Conf. on Management of Data (SIGMOD 2007), June 2007

  68. S. Duan and S. Babu. Proactive Identification of Performance Problems
    Demonstrated at the 2006 ACM Intl. Conf. on Management of Data (SIGMOD 2006), June 2006

  69. S. Babu, P. Bizarro, and D. DeWitt. Proactive Re-optimization with Rio
    Demonstrated at the 2005 ACM Intl. Conf. on Management of Data (SIGMOD 2005), June 2005

  70. S. Babu and J. Widom. StreaMon: An Adaptive Engine for Stream Query Processing
    Demonstrated at the 2004 ACM Intl. Conf. on Management of Data (SIGMOD 2004), June 2004

  71. A. Arasu, B. Babcock, S. Babu, M. Datar, K. Ito, I. Nishizawa, J. Rosenstein, and J. Widom. STREAM: The Stanford Stream Data Manager
    Demonstrated at the 2003 ACM Intl. Conf. on Management of Data (SIGMOD 2003), June 2003
  72. Technical Reports Under Submission

  73. N. Borisov and S. Babu. Rapid Experimentation for Trustworthy Testing and Tuning in Production Database Deployments
    August 2011.

  74. R. Thonangi, S. Babu, and J. Yang. A Practical Concurrent Index for Solid-State Drives
    July 2011.

  75. H. Lim, H. Herodotou, S. Babu, and J. Chase. Auto-Optimized Execution of Data-Parallel Cluster Workflows
    March 2011.
  76. Technical Reports

    N. Borisov, S.Babu, S. Uttamchandani, R. Routray, and A. Singh. Why Did My Query Slow Down?
    Technical report at arXiv.org.
    Borisov, Uttamchandani, Routray, and Singh are the authors of a conference version of this technical report published in the Proceedings track of the 2009 Conference on Innovative Data Systems Research (CIDR), January 2009. Babu, while a contributor to this work, is not an author of the CIDR 2009 conference paper because of CIDR's "one-submission-per-author" rule.

    Articles

  77. S. Babu. Continuous Query
    In Encyclopedia of Database Systems, Springer, 2008.

  78. The STREAM Group. STREAM: The Stanford Data Stream Manager
    In IEEE Data Engineering Bulletin, Vol. 26, No. 1, March 2003

  79. S. Babu, M. Garofalakis, and R. Rastogi. SPARTAN: Using Constrained Models for Guaranteed-Error Semantic Compression
    In SIGKDD Explorations, Vol. 4, No. 1, June 2002

  80. S. Babu and J. Widom. Continuous Queries over Data Streams
    In SIGMOD Record, September 2001
  81. Tutorials

  82. S. Babu, S. Guha, and K. Munagala. Large-scale Uncertainty Management Systems: Learning and Exploiting your Data
    Given at the ACM Intl. Conf. on Management of Data (SIGMOD 2009), June 2009
  83. Book Chapters

  84. A. Arasu, B. Babcock, S. Babu, J. Cieslewicz, M. Datar, K. Ito, R. Motwani, U. Srivastava, and J. Widom. STREAM: The Stanford Data Stream Management System
    In Data-Stream Management: Processing High-Speed Data Streams, Springer-Verlag, New York. ISBN: 978-3-540-286073-3.
  85. Dissertation

  86. S. Babu. Adaptive Query Processing in Data Stream Management Systems
    Ph.D. thesis, Stanford University, 2005