AI startup that indexes the entire web - provides Diffbot Knowledge Graph (DKG) query services in Google Sheets and Microsoft Excel to instantly enrich existing user databases with comprehensive and publicly available information about companies and organizations. The main challenge was to build the best extraction API to process pages found by Crawlbot
The following features were identified for functionality expansion:
Our team joined in 2020 with 2 fullstack developers.
Our team members mainly worked with Java and Cloud frameworkes
Number of our contractors:
1 Back end
1 Front end
1 QA testing
Usually 1-2 people were simultaneously attached to the project. Our key responsobilies was to create side infrastructure stuff.
Distributed, world-class crawling infrastructure processing millions of pages daily.
Plug-and-play scraping and Knowledge Graph access
Clean structured data (such as JSON or CSV), ready for use
No spam, only relevant and useful articles
Congratulations!
You will now be the first to know the latest news and receive useful articles.