[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1807167.1807315acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
demonstration

Midas: integrating public financial data

Published: 06 June 2010 Publication History

Abstract

The primary goal of the Midas project is to build a system that enables easy and scalable integration of unstructured and semi-structured information present across multiple data sources. As a first step in this direction, we have built a system that extracts and integrates information from regulatory filings submitted to the U.S. Securities and Exchange Commission (SEC) and the Federal Deposit Insurance Corporation (FDIC). Midas creates a repository of entities, events, and relationships by extracting, conceptualizing, integrating, and aggregating data from unstructured and semi-structured documents. This repository enables applications to use the extracted and integrated data in a variety of ways including mashups with other public data and complex risk analysis.

References

[1]
Bank of America Corp. Current Report. http://www.sec.gov/Archives/edgar/data/70858/000119312509012615/d8k.htm, January 2009. Form 8-K.
[2]
K. S. Beyer and V. Ercegovac. Jaql: a Query Language for JSON. http://code.google.com/p/jaql/, 2009.
[3]
K. S. Beyer, V. Ercegovac, R. Krishnamurthy, S. Raghavan, J. Rao, F. Reiss, E. J. Shekita, D. E. Simmen, S. Tata, S. Vaithyanathan, and H. Zhu. Towards a scalable enterprise content analytics platform. IEEE Data Eng. Bull., 32(1):28--35, 2009.
[4]
N. N. Dalvi, R. Kumar, B. Pang, R. Ramakrishnan, A. Tomkins, P. Bohannon, S. Keerthi, and S. Merugu. A Web of Concepts. In PODS, pages 1--12, 2009.
[5]
R. Krishnamurthy, Y. Li, S. Raghavan, F. Reiss, S. Vaithyanathan, and H. Zhu. SystemT: a System for Declarative Information Extraction. SIGMOD Record,37(4):7--13, 2008.
[6]
Merrill Lynch & Co., Inc. Proxy Stmt. http://www.sec.gov/Archives/edgar/data/65100/000093041308001703/c52269 def14a.htm, April 2008. Form DEF-14A.
[7]
A. Sala, C. Lin, and H. Ho. Midas for Government: Integration of Government Spending Data on Hadoop. In Second Int'l Workshop on New Trends in Information Integration (NTII), 2010.
[8]
J. A. Thain. Stmt. of Beneficial Ownership. http://www.sec.gov/Archives/edgar/data/70858/000122520809000096/0001225208-09-000096.txt, January 2009. Form 3.

Cited By

View all
  • (2017)Creation and interaction with large-scale domain-specific knowledge basesProceedings of the VLDB Endowment10.14778/3137765.313782010:12(1965-1968)Online publication date: 1-Aug-2017
  • (2015)Multi-Agent Financial Network (MAFN) Model of US Collateralized Debt Obligations (CDO)Banking, Finance, and Accounting10.4018/978-1-4666-6268-1.ch030(561-590)Online publication date: 2015
  • (2015)Information Extraction of Regulatory Enforcement ActionsProceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 201510.1145/2808797.2809368(950-953)Online publication date: 25-Aug-2015
  • Show More Cited By

Index Terms

  1. Midas: integrating public financial data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '10: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
    June 2010
    1286 pages
    ISBN:9781450300322
    DOI:10.1145/1807167
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 06 June 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data cleansing
    2. financial data
    3. information extraction
    4. information integration

    Qualifiers

    • Demonstration

    Conference

    SIGMOD/PODS '10
    Sponsor:
    SIGMOD/PODS '10: International Conference on Management of Data
    June 6 - 10, 2010
    Indiana, Indianapolis, USA

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 12 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)Creation and interaction with large-scale domain-specific knowledge basesProceedings of the VLDB Endowment10.14778/3137765.313782010:12(1965-1968)Online publication date: 1-Aug-2017
    • (2015)Multi-Agent Financial Network (MAFN) Model of US Collateralized Debt Obligations (CDO)Banking, Finance, and Accounting10.4018/978-1-4666-6268-1.ch030(561-590)Online publication date: 2015
    • (2015)Information Extraction of Regulatory Enforcement ActionsProceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 201510.1145/2808797.2809368(950-953)Online publication date: 25-Aug-2015
    • (2015)A Content-Driven ETL Processes for Open DataNew Trends in Database and Information Systems II10.1007/978-3-319-10518-5_3(29-40)Online publication date: 2015
    • (2014)Characterizing Utilitarian Aggregation of Open KnowledgeProceedings of the 1st IKDD Conference on Data Sciences10.1145/2567688.2567689(1-11)Online publication date: 21-Mar-2014
    • (2013)Multi-Agent Financial Network (MAFN) Model of US Collateralized Debt Obligations (CDO)Simulation in Computational Finance and Economics10.4018/978-1-4666-2011-7.ch012(225-254)Online publication date: 2013
    • (2013)A platform for eXtreme analyticsIBM Journal of Research and Development10.1147/JRD.2013.224269357:3-4(4-4)Online publication date: 1-May-2013
    • (2013)A survey of Indian open dataProceedings of the 5th IBM Collaborative Academia Research Exchange Workshop10.1145/2528228.2528230(1-4)Online publication date: 17-Oct-2013
    • (2013)SASHProceedings of the 2013 IEEE International Conference on Data Engineering (ICDE 2013)10.1109/ICDE.2013.6544911(1219-1230)Online publication date: 8-Apr-2013
    • (2013)Systemic risk analytics: A data-driven multi-agent financial network (MAFN) approachJournal of Banking Regulation10.1057/jbr.2013.1014:3-4(285-305)Online publication date: 30-Aug-2013
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media