Nannonstomus
Nannonstomus

Nannonstomus is a cloud engine that harvests data from online websites and then turns the raw data into any format. With this solution, your business will extract value from complex and voluminous sets of data and will be able to make data-driven decisions leading to the company’s success.

What Challenges We Endured

To bring the Nannonstomus project to life, the Intsurfing team had to find solutions to the following issues:

  • Enormous data volumes - the delivered engine had to deal with massive data chunks by processing a huge number of websites;
  • Processing time - we set to shorten the time for scraping the websites, collecting, wrangling, analyzing, and visualizing data;
  • Protected websites - our team had to deliver a solution capable of harvesting data from captcha- and proxy-protected websites;
  • Scalability - no matter the data volume, the Nannonstomus was supposed to let you process any data sets on demand;
  • Flexibility - our solution had to respond to versatile needs - from in-house implementation for corporate use to hassle-free data delivery in a variety of formats.
Here Is What We Delivered

Nannonstomus offers flexible solutions to businesses that want to yield results from big data. This platform runs on a cloud, giving the advantages of scalability and smooth performance. No matter what data volume you request to process or deliver, Nannonstomus will always have enough resources at hand to fulfill your request. Nannonstomus accesses information from all types of websites, including the ones protected with a captcha and proxy.

Nannonstomus features

Here is what the Nannonstomus big-data platform can do:

  • Data scraping - extracts data from the website and converts it into a more convenient format;
  • Data harvesting - analyzes large data sets to uncover patterns, eliminate duplicates, merge information, etc.;
  • Data wrangling - transforms and maps raw data into more readily used formats for gaining valuable insights;
  • Data analysis - inspects, transforms, and models data to let you discover useful information and make conclusions.

Nannonstomus does the same things as other data extraction services but does it FOUR times faster. With this technology, what once required 180 operational hours will take only 45 hours! Just think of the time and money savings! They are huge with Nannonstomus.

How Nannonstomus Works

To enable flexibility, several Nannonstomus use scenarios are possible:

  1. You make a request and get data in the desired format, in the specified place.
  2. You implement the Nannonstomus in-house and leverage all its advantages to the full at any time, with any number of users.
Tech Stack
  • .NET Core 2.0
  • WPF
  • SQL Server 2016
  • Amazon AWS
  • Tesseract OCR
  • iText PDF