[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2616498.2616543acmotherconferencesArticle/Chapter ViewAbstractPublication PagesxsedeConference Proceedingsconference-collections
research-article

Building an Information System for a Distributed Testbed

Published: 13 July 2014 Publication History

Abstract

This paper describes an information system designed to support the large volume of monitoring information generated by a distributed testbed. This monitoring information is produced by several subsystems and consists of status and performance data that needs to be federated, distributed, and stored in a timely and easy to use manner. Our approach differs from existing approaches because it federates and distributes information at a low architectural level via messaging; a natural match to many of the producers and consumers of information. In addition, a database is easily layered atop the messaging layer for consumers that want to query and search the information. Finally, a common language to represent information in all layers of the information system makes it significantly easier for users to consume information. Performance data shows that this approach meets the significant needs of FutureGrid and would meet the needs of other infrastructures.

References

[1]
S. Aiyagari et al. AMQP: Advanced Message Queuing Protocol Specification Version 0.9.1. Technical Report 0.9.1, AMQP Working Group, November 2008.
[2]
AMQP: Advanced Message Queuing Protocol. http://www.amqp.org.
[3]
S. Andreozzi, S. Burke, F. Ehm, L. Field, G. Galang, B. Konya, M. Litmaath, P. Millar, and J. Navarro. GLUE Specification v. 2.0. Technical Report GFD-R-P.147, The Open Grid Forum, March 2009.
[4]
Charlie Catlett. The Philosophy of TeraGrid: Building an Open, Extensible, Distributed TeraScale Facility. In Proceedings of the 2nd International Symposium on Cluster Computing and the Grid, 2002.
[5]
European Grid Infrastructure -- towards a sustainable infrastructure. http://www.egi.eu.
[6]
R. P. et al. The Open Science Grid. Journal of Physics: Conference Series, 78, 2007.
[7]
S. Fitzgerald, I. Foster, C. Kesselman, G. vol Laszewski, W. Smith, and S. Tuecke. A Directory Service for Configuring High-Performance Distributed Computations. In Sixth IEEE International Symposium on High Performance Distributed Computing, 1997.
[8]
G. Fox, G. von Laszewski, J. Diaz, K. Keahey, J. Fortes, R. Figueiredo, S. Smallen, W. Smith, and A. Grimshaw. Contemporary High Performance Computing: From Petascale toward Exascale, chapter FutureGrid - a reconfigurable testbed for Cloud, HPC and Grid Computing. Chapman and Hall, 2013.
[9]
T. R. Furlani, M. D. Jones, S. M. Gallo, A. E. Bruno, C.-D. Lu, A. Ghadersohi, R. J. Gentner, A. Patra, R. L. DeLeon, G. von Laszewski, F. Wang, and A. Zimmerman. Performance metrics and auditing framework using application kernels for high-performance computer systems. Concurrency and Computation: Practice and Experience, 25(7):918--931, 2013.
[10]
FutureGrid: An Experimental, High-Performance Grid Test-Bed. http://portal.futuregrid.org.
[11]
G. Garzoglio, T. Levshina, P. Mhashilkar, and S. Timm. ReSS: A Resource Selection Service for the Open Science Grid. Technical report, Fermilab, 2008.
[12]
Gratia. https://www.opensciencegrid.org/bin/view/Accounting/WebHome.
[13]
Gstat 2.0. http://gstat2.grid.sinica.edu.tw.
[14]
A. Hanemann et al. Perfsonar: A service oriented architecture for multi-domain network monitoring. In In Proceedings of the Third International Conference on Service Oriented Computing (ICSOC 2005). ACM Sigsoft and Sigweb, 2005.
[15]
M. Hanlon, W. Smith, and S. Mock. Providing Resource Information to Users of a National Computing Center. In Proceedings of the XSEDE13 conference, July 2013.
[16]
The Information Publishing Framework. https://bitbucket.org/wwsmith/ipf.
[17]
Introducing JSON. http://www.json.org.
[18]
L. Liming et al. Teragrid's integrated information service. In Proceedings of the 5th Grid Computing Environments Workshop, GCE '09, pages 8:1--8:10, New York, NY, USA, 2009. ACM.
[19]
M. Massie, B. Chun, and D. Culler. The Ganglia Distributed Monitoring System: Design, Implementation, and Experience. Parallel Computing, April 2004.
[20]
MyOSG One-Stop location for various OSG information. https://myosg.grid.iu.edu.
[21]
Nagios - The Industry Standard In IT Infrastructure Monitoring. http://www.nagios.org.
[22]
The Open Science Grid. http://www.opensciencegrid.org.
[23]
PostgreSQL: The world's most advanced open source database. http://www.postgres.org.
[24]
PRACE Research Infrastructure - The top level of the European HPC ecosystem. http://www.prace-project.eu.
[25]
First Annual Operations Report of the Tier-1 Service. http://www.prace-ri.eu/IMG/pdf/D6-1_2ip.pdf.
[26]
RabbitMQ: Messaging that just works. http://www.rabbitmq.com.
[27]
RabbitMQ Performance Measurements, part 2. http://www.rabbitmq.com/blog/2012/04/25/rabbitmq-performance-measurements-part-2/.
[28]
R. Raman, M. Livny, and M. Solomon. Matchmaking: Distributed resource management for high throughput computing. In Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing (HPDC7), Chicago, IL, July 1998.
[29]
The Resource and Service Validation (RSV) Service. http//www.opensciencegrid.org/bin/view/Documentation/Release3/RsvOverview.
[30]
J. M. Schopf, L. Pearlman, N. Miller, C. Kesselman, and A. Chervenak. Monitoring the grid with the globus toolkit mds4. Journal of Physics: Conference Series, 46, 2006.
[31]
S. Smallen, K. Ericson, J. Hayes, and C. Olschanowsky. User-level grid monitoring with inca 2. In Proceedings of the 2007 workshop on Grid monitoring, GMW '07, pages 29--38, New York, NY, USA, 2007. ACM.
[32]
W. Smith. An Information Architecture Based on Publish/Subscribe Messaging. In Proceedings of the 2011 TeraGrid Conference, 2011.
[33]
SNAPP - SNMP Network Analysis and Presentation Package. http://snapp.sourceforge.net, 2007.
[34]
B. Tierney et al. The NetLogger Methodology for High Performance Distributed Systems Performance Analysis. In In Proc. 7th IEEE Symp. on High Performance Distributed Computing, pages 260--267, 1998.
[35]
XSEDE: eXtreme Science and Engineering Discovery Environment. http://www.xsede.org.

Cited By

View all
  • (2016)An Adaptive Road Traffic Regulation with Simulation and Internet of ThingsProceedings of the 2016 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation10.1145/2901378.2901406(3-11)Online publication date: 15-May-2016
  • (2015)Publishing and consuming GLUE v2.0 resource information in XSEDEProceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure10.1145/2792745.2792770(1-8)Online publication date: 26-Jul-2015

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
XSEDE '14: Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment
July 2014
445 pages
ISBN:9781450328937
DOI:10.1145/2616498
  • General Chair:
  • Scott Lathrop,
  • Program Chair:
  • Jay Alameda
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

  • NSF: National Science Foundation
  • Drexel University
  • Indiana University: Indiana University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 July 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cyberinfrastructure
  2. information system
  3. messaging
  4. publish/subscribe
  5. testbed

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

XSEDE '14

Acceptance Rates

XSEDE '14 Paper Acceptance Rate 80 of 120 submissions, 67%;
Overall Acceptance Rate 129 of 190 submissions, 68%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2016)An Adaptive Road Traffic Regulation with Simulation and Internet of ThingsProceedings of the 2016 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation10.1145/2901378.2901406(3-11)Online publication date: 15-May-2016
  • (2015)Publishing and consuming GLUE v2.0 resource information in XSEDEProceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure10.1145/2792745.2792770(1-8)Online publication date: 26-Jul-2015

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media