Extracting, Linking and Integrating Data from Public Sources : A Financial Case Study
We present Midas, a system that uses complex data processing to extract and aggregate facts from a large collection of structured and unstructured documents into a set of unified, clean entities and relationships. Midas focuses on data for financial companies and is based on periodic filings with the U.S. Securities and Exchange Commission (SEC) and Federal Deposit Insurance Corporation (FDIC). We show that, by using data aggregated by Midas, we can provide valuable insights about financial institutions either at the whole system level or at the individual company level. The key technology components that we implemented in Midas and that enable the various financial applications are: information extraction, entity resolution, mapping and fusion, all on top of a scalable infrastructure based on Hadoop. We describe our experience in building the Midas system and also outline the key research questions that remain to be addressed towards building a generic, high-level infrastructure for large-scale data integration from public sources
Year of publication: |
2015
|
---|---|
Authors: | Burdick, Douglas |
Other Persons: | Hernandez, Mauricio (contributor) ; Ho, Howard (contributor) ; Koutrika, Georgia (contributor) ; Krishnamurthy, Rajasekar (contributor) ; Popa, Lucian Constantin (contributor) ; Stanoi, Ioana (contributor) ; Vaithyanathan, Shivakumar (contributor) ; Das, Sanjiv Ranjan (contributor) |
Publisher: |
[2015]: [S.l.] : SSRN |
Saved in:
Extent: | 1 Online-Ressource (8 p) |
---|---|
Type of publication: | Book / Working Paper |
Language: | English |
Notes: | Nach Informationen von SSRN wurde die ursprüngliche Fassung des Dokuments September 28, 2015 erstellt |
Other identifiers: | 10.2139/ssrn.2666384 [DOI] |
Source: | ECONIS - Online Catalogue of the ZBW |
Persistent link: https://www.econbiz.de/10013014653
Saved in favorites
Similar items by person
-
Unleashing the Power of Public Data for Financial Risk Measurement, Regulation, and Governance
Hernandez, Mauricio, (2015)
-
Vaithyanathan, Shivakumar, (1996)
-
Simplifying information integration : object-based flow-of-mappings framework for integration
Alexe, Bogdan, (2009)
- More ...