USA voters and NCOA
USA voters and NCOA

Big data project about data standardization, cleansing and indexing for SOLR.

The goal was to create a fast search backend solution for Web UI.

Some facts about the project:

  • About 4 billion records
  • Address and name standardization using IntEngine
  • Source dbs includes: Infutor, Movers, Thrive, NCOA, Spoke
  • .NET 4.5
  • WPF
  • WCF
  • SQL Server 2012
  • Amazon AWS
  • Hadoop
  • SOLR