[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/545215.545244acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
Article

Experiences with VI communication for database storage

Published: 01 May 2002 Publication History

Abstract

This paper examines how VI-based interconnects can be used to improve I/O path performance between a database server and the storage subsystem. We design and implement a software layer, DSA, that is layered between the application and VI. DSA takes advantage of specific VI features and deals with many of its shortcomings. We provide and evaluate one kernel-level and two user-level implementations of DSA. These implementations trade transparency and generality for performance at different degrees, and unlike research prototypes are designed to be suitable for real-world deployment. We present detailed measurements using a commercial database management system with both micro-benchmarks and industrial database workloads on a mid-size, 4 CPU, and a large, 32 CPU, database server.Our results show that VI-based interconnects and user-level communication can improve all aspects of the I/O path between the database system and the storage back-end. We also find that to make effective use of VI in I/O intensive environments we need to provide substantial additional functionality than what is currently provided by VI. Finally, new storage APIs that help minimize kernel involvement in the I/O path are needed to fully exploit the benefits of VI-based communication.

References

[1]
A. Ailamaki, D. J. DeWitt, M. D. Hill, and D. A. Wood. DBMSs on a modern processor: Where does time go? In M. P. Atkinson, M. E. Orlowska, P. Valduriez, S. B. Zdonik, and M. L. Brodie, editors, Proceedings of the Twenty-fifth International Conference on Very Large Databases, 1999.
[2]
D. C. Anderson, J. S. Chase, S. Gadde, A. J. Gallatin, K. G. Yocum, and M. J. Feeley. Cheating the I/O bottleneck: Network storage with Trapeze/Myrinet. In Proceedings of the USENIX 1998 Annual Technical Conference, 1998.
[3]
A. Basu, V. Buch, W. Vogels, and T. von Eicken. U-net: A user-level network interface for parallel and distributed computing. Proceedings of the 15th ACM Symposium on Operating Systems Principles (SOSP), Copper Mountain, Colorado, December 1995.
[4]
A. Basu, M. Welsh, and T. von Eicken. Incorporating memory management into user-level network interfaces. http://www2.cs.cornell.edu/U-Net/papers/unetmm.pdf, 1996.
[5]
B. C. Bialek. Leading vendors validate power of clustering architecture, detail of the tpc-c audited benchmark. http://wwwip.emulex.com/ip/pdfs/performance/IBM_TPCC_Benchmark. pdf, Jul. 2000.
[6]
W. M. Cardoza, F. S. Glover, and W. E. Snaman, Jr. Design of the TruCluster multicomputer system for the Digital UNIX environment. Digital Technical Journal of Digital Equipment Corporation, 8(1): 5-17, 1996.
[7]
E. V. Carrera, S. Rao, L. Iftode, and R. Bianchini. User-level communication in cluster-based servers. In Proceedings of the 8th IEEE International Symposium on High-Performance Computer Architecture (HPCA 8), 2002.
[8]
Y. Chen, A. Bilas, S. N. Damianakis, C. Dubnicki, and K. Li. UTLB: A mechanism for address translation on network interfaces. In Proceedings of the Eighth International Conference Architectural Support for Programming Languages and Operating Systems ASPLOS, pages 193-203, San Jose, CA, Oct. 1998.
[9]
B. N. Chun, A. M. Mainwaring, and D. E. Culler. Virtual network transport protocols for myrinet. In Hot Interconnects Symposium V, Stanford, CA, August 1997.
[10]
Compaq/Intel/Microsoft. Virtual Interface Architecture Specification, Version 1.0, Dec. 1997.
[11]
DAFS Collaborative. DAFS: Direct Access File System Protocol Version: 1.00, Sept. 2001.
[12]
C. Dubnicki, A. Bilas, Y. Chen, S. Damianakis, and K. Li. VMMC-2: efficient support for reliable, connection-oriented communication. In Proceedings of Hot Interconnects, Aug. 1997.
[13]
D. Dunning and G. Regnier. The Virtual Interface Architecture. In Proceedings of Hot Interconnects V Symposium, Stanford, Aug. 1997.
[14]
G. A. Gibson, D. F. Nagle, K. Amiri, J. Butler, F. W. Chang, H. Gobioff, C. Hardin, E. Riedel, D. Rochberg, and J. Zelenka. A cost-effective, high-bandwidth storage architecture. In Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems, 1998.
[15]
G. A. Gibson, D. F. Nagle, K. Amiri, F. W. Chang, E. M. Feinberg, H. Gobioff, C. Lee, B. Ozceri, E. Riedel, D. Rochberg, and J. Zelenka. File server scaling with network-attached secure disks. In Proceedings of the 1997 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 1997.
[16]
Giganet. Giganet whitepaper: Accelerating and scaling datanetworks microsoft sql server 2000 and giganet clan. http://wwwip.emulex.com/ip/pdfs/performance/sql2000andclan.pdf, Sept. 2000.
[17]
Giganet. Giganet cLAN family of products. http://www.emulex.com/products. html, 2001.
[18]
R. Gillett, M. Collins, and D. Pimm. Overview of network memory channel for PCI. In Proceedings of the IEEE Spring COMPCON '96, Feb. 1996.
[19]
H. Gregory, J. Thomas, P. McMahon, A. Skjellum, and N. Doss. Design of the BDM family of myrinet control programs, 1998.
[20]
InfiniBand Trade Association. Infiniband architecture specification, version 1.0. http://www.infinibandta.org, Oct. 2000.
[21]
K. Keeton, D. Patterson, Y. He, R. Raphael, and W. Baker. Performance characterization of a Quad Pentium Pro SMP using OLTP Workloads. In Proceedings of the 25th Annual International Symposium on Computer Architecture (ISCA-98), 1998.
[22]
Microsoft. Address windowing extensions and microsoft windows 2000 datacenter server. Windows Hardware Engineering Conference: Advancing the Platform. Also available at: http://msdn.microsoft.com/library/default.asp?url=/library/enus/dnw2k/html/awewindata.asp, March 30 1999.
[23]
Mylex. eXtremeRAID 3000 High Performance 1Gb Fibre RAID Controller. http://www.mylex.com.
[24]
ORACLE. Oracle net vi protocol support, a technical white paper. http://www.vidf.org/Documents/whitepapers/Oracle_VI.pdf, February 2001.
[25]
S. Pakin, V. Karamcheti, and A. A. Chien. Fast Messages: Efficient, portable communication for workstation clusters and massively parallel processors (MPP). IEEE Concurrency, 5(2): 60-73, April-June 1997. University of Illinois.
[26]
M. Rosenblum, E. Bugnion, S. A. Herrod, E. Witchel, and A. Gupta. The impact of architectural trends on operating system performance. In Symposium on Operating Systems Principles, pages 285-298, 1995.
[27]
H. Tezuka, A. Hori, and Y. Ishikawa. PM: A high-performance communication library for multi-user parallel environments. Technical Report TR-96015, Real World Computing Partnership, Nov. 1996.
[28]
Transaction Processing Performance Council. TPC Benchmark C. Shanley Public Relations, 777 N. First Street, Suite 600, San Jose, CA 95112-6311, May 1991.
[29]
M. Uysal, A. Acharya, and J. Saltz. Evaluation of active disks for decision support databases. In Proceedings of the Sixth International Symposium on High-Performance Computer Architecture, pages 337-348, Toulouse, France, Jan. 8-12, 2000. IEEE Computer Society TCCA.
[30]
J. Wilkes, R. Golding, C. Staelin, and T. Sullivan. The HP AutoRAID hierarchical storage system. ACM Transactions on Computer Systems, 14(1): 108-136, Feb. 1996.
[31]
Y. Zhou, J. F. Philbin, and K. Li. The multi-queue replacement algorithm for second level buffer caches. In USENIX Annual Technical Conference, pages 91-104, June 2001.

Cited By

View all
  • (2012)Providing safe, user space access to fast, solid state disksACM SIGPLAN Notices10.1145/2248487.215101747:4(387-400)Online publication date: 3-Mar-2012
  • (2012)Providing safe, user space access to fast, solid state disksACM SIGARCH Computer Architecture News10.1145/2189750.215101740:1(387-400)Online publication date: 3-Mar-2012
  • (2012)Providing safe, user space access to fast, solid state disksProceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems10.1145/2150976.2151017(387-400)Online publication date: 3-Mar-2012
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ISCA '02: Proceedings of the 29th annual international symposium on Computer architecture
May 2002
346 pages
ISBN:076951605X
  • Conference Chair:
  • Yale Patt,
  • Program Chair:
  • Dirk Grunwald,
  • Publications Chair:
  • Kevin Skadron
  • cover image ACM SIGARCH Computer Architecture News
    ACM SIGARCH Computer Architecture News  Volume 30, Issue 2
    Special Issue: Proceedings of the 29th annual international symposium on Computer architecture (ISCA '02)
    May 2002
    304 pages
    ISSN:0163-5964
    DOI:10.1145/545214
    Issue’s Table of Contents

Sponsors

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 May 2002

Check for updates

Author Tags

  1. Database storage
  2. Storage system
  3. User-level Communication
  4. Virtual Interface Architecture
  5. cluster-based storage
  6. processor overhead
  7. storage area network

Qualifiers

  • Article

Conference

ISCA02
Sponsor:

Acceptance Rates

ISCA '02 Paper Acceptance Rate 27 of 180 submissions, 15%;
Overall Acceptance Rate 543 of 3,203 submissions, 17%

Upcoming Conference

ISCA '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2012)Providing safe, user space access to fast, solid state disksACM SIGPLAN Notices10.1145/2248487.215101747:4(387-400)Online publication date: 3-Mar-2012
  • (2012)Providing safe, user space access to fast, solid state disksACM SIGARCH Computer Architecture News10.1145/2189750.215101740:1(387-400)Online publication date: 3-Mar-2012
  • (2012)Providing safe, user space access to fast, solid state disksProceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems10.1145/2150976.2151017(387-400)Online publication date: 3-Mar-2012
  • (2011)Scalable memory registration for high performance networks using helper threadsProceedings of the 8th ACM International Conference on Computing Frontiers10.1145/2016604.2016652(1-10)Online publication date: 3-May-2011
  • (2007)Optimization and bottleneck analysis of network block I/O in commodity storage systemsProceedings of the 21st annual international conference on Supercomputing10.1145/1274971.1274979(33-42)Online publication date: 17-Jun-2007
  • (2007)An SSL Back-End Forwarding Scheme in Cluster-Based Web ServersIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2007.106218:7(946-957)Online publication date: 1-Jul-2007
  • (2006)Benefits of high speed interconnects to cluster file systemsProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898699.1898791(273-273)Online publication date: 25-Apr-2006
  • (2006)Efficient remote block-level I/O over an RDMA-capable NICProceedings of the 20th annual international conference on Supercomputing10.1145/1183401.1183417(97-106)Online publication date: 28-Jun-2006
  • (2006)Design Trade-Offs for User-Level I/O ArchitecturesIEEE Transactions on Computers10.1109/TC.2006.12255:8(962-973)Online publication date: 1-Aug-2006
  • (2005)HibernatorProceedings of the twentieth ACM symposium on Operating systems principles10.1145/1095810.1095828(177-190)Online publication date: 23-Oct-2005
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media