Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.
Accelerated Text is a natural language generation tool which takes descriptions of your data and then produces multiple versions of those descriptions varying in wording and text structure.
Easily crawl news portals or blog sites using Storm Crawler.
Example: crawl 10,000,000 web pages per day and make them available for enterprise search.
Example: given a list of websites of investment funds, determine the geographic make up of their exposure.
Example: index 500,000 quarterly reports, then determine what is important to rank in the top 10 for each query of interest.
Example: identifying market reactions to fluctuations of commodity prices as manifested in popular media.
Example: Retrieve auditor details from a repository of quarterly company reports.
Example: automatically generate monthly employee performance reports for different stakeholders.
NLP pipeline with crawler and venture capital funding event detection.
NLP library used as part of Weborama's media monitoring package.
Custom company web page crawl to extract information about business activities.
NLP pipeline with crawler, job advertisement identification and contact person recognition.
NLP pipeline with crawler. Event detection related to financial instruments. Timeseries database population.
Crawler, named entity recognition, text classification, clustering, deduplication, text similarity estimation and sentiment analysis.
NLP pipeline with web and social media crawler, named entity recognition, sentiment analysis and article classification.
Open source word stemmer and page function identification algorithm. Research into customer care messages classification.