Rebecca C. Steorts
August 30, 2016
How do we cope with duplicated administrative records?
How do we cope with duplicated medical records?
How do we cope with duplicated deaths in the Syrian conflict?
Record linkage joins multiple databases (without unique identifiers) to remove duplicated entities.
Record linkage is also known as entity resolution or coreference resolution.
Record linkage: merging more than one database to remove duplicated entities.
De-duplication: removing duplicated entities from one database.
Which inventor records from the US Patent & Trademark Office (USPTO) database correspond to the same unique individuals?
USPTO: 8 million patents, multiple inventors per patent
Literature is broken up into two classes mainly (supervised and unsupervised methods)
Throughout the course the goals of the course are: