This tutorial addresses the question:
How would you use some of the Public Combinations in our Haven OnDemand Combinations Marketplace, to process and enrich your own real-world unstructured information?
In this Use Case 2, we explore how you can use Public Combinations on data that has already been formatted and uploaded into a Haven OnDemand Text Index.
Haven OnDemand provides a substantial collection of public datasets. You can use these as correctly formatted samples to set up and try out your applications, before progressing to the step of building your own datasets.
Additionally, the public datasets provide a wealth of information in their own right. We have Wikipedia, archives of scientific papers and patents, a database of transport links, the CIA World Factbook, and... live newsfeeds from leading press sources in four languages.
In this tutorial, we look into the indexing process and examine the public text indexes.
Then, we create a Combination which, beyond searching the newsfeeds on query terms that you supply, enriches them with additional information on the entities, language and sentiment found in them.
We are using:
- The Query Text Index API
- The Enrich group of Public Combinations
- And more specifically, the complex Combination called Enrich Documents Array
See also Use Case 1, in which we cover unformatted data in your file systems.
You must be a registered user to add a comment here. If you've already registered, please log in. If you haven't registered yet, please click login and create a new account.