Index update: Topics
It’s time for another Index update!
Over here at Index HQ we’re always hard at work, plugging away to add useful features for all your tech-scouring needs.
Index runs on news. Every day we push over 1,000 articles through the system’s ever-hungry maw and match them to companies and locations in our database. News is the context layer that informs and enriches the data we gather.
But simple processing isn't enough. The average article has so much more to offer between the headline and author bio than raw data alone. As we continue to find ways to teach Index to read between the lines, we build features that utilize the stories we find there.
Our first step in that direction is topics. Topic detection on Index works through natural language processing and supervised machine learning. By processing news articles we determine what the main topic of the text is, and label it accordingly.
We’re taking a classic iterative approach here, and releasing small updates as we go. For now, topics are purely informative – we’ll be labeling articles in your news streams with their respective topics, but we’re not yet collecting them in one place.
Before we move forward, we want to see what the community thinks of the topics. Which do you find useful, which seem redundant? Which are spot-on and which are completely inaccurate?
Download PDFDownload PDF
Index provides insights in private tech companies by turning unstructured content into structured data and intelligence. A tool for the tech community, tech companies, journalists, investors and analysts.