A flexible, scaleable approach to the international patent 'name game'
This paper reports a new approach to disambiguation of large patent databases. Available international patent databases do not identify unique innovators. Record disambiguation poses a significant barrier to subsequent research. Present methods for overcoming this barrier couple ad-hoc rules for name harmonisation with labourintensive manual checking. We present instead a computational approach that requires minimal and easily automated data cleaning, learns appropriate record-matching criteria from minimal human coding, and dynamically addresses both computational and data-quality issues that have impeded progress. We show that these methods yield accurate results at rates comparable to outcomes from more resource-intensive hand coding.
Year of publication: |
2014
|
---|---|
Authors: | Huberty, Mark ; Serwaah, Amma ; Zachmann, Georg |
Publisher: |
Brussels : Bruegel |
Saved in:
freely available
Series: | Bruegel Working Paper ; 2014/10i |
---|---|
Type of publication: | Book / Working Paper |
Type of publication (narrower categories): | Working Paper |
Language: | English |
Other identifiers: | 798195134 [GVK] hdl:10419/126714 [Handle] |
Source: |
Persistent link: https://www.econbiz.de/10011420984
Saved in favorites
Similar items by person
-
A scaleable approach to emissions-innovation record linkage
Huberty, Mark, (2014)
-
A scaleable approach to emissions-innovation record linkage
Huberty, Mark, (2014)
-
A flexible, scaleable approach to the international patent 'name game'
Huberty, Mark, (2014)
- More ...