Part I:
In our previous blog, we explained to you briefly about the structure of patterns and how IBM is poised to assist you with the latest offerings in Watson Discovery. In this blog we will explain to you Pattern Induction and how you can utilize it with ease in IBM Watson Discovery.
Pattern Induction is a human-in-the-loop system that combines the expertise of domain experts with automatic learning capabilities to quickly learn a high-quality extractor. In this system we enable human experts to quickly provide examples and feedback to system suggestions to achieve domain-specific results and high coverage and quality.
Let us walk you through a typical Pattern Induction workflow from the perspective of the user. For the sake of the example our goal is to extract revenue information from financial documents, as discussed in our earlier blog.
Prerequisites: Before starting, please create a Pattern Induction project, by following the few easy steps outlined in the “Try out Pattern Induction” section towards the end of this blog.
STEP 1: Highlight a few examples. Once you completed the prerequisites, start by highlighting a few strings that belong to the pattern you want to extract (see example figure 1 below). Once you have provided enough examples (we recommend at least two for this version of the release), the system will learn the general pattern underlying the provided examples.
Tip: We encourage you to start off with providing two examples and waiting for the system to finish learning before you provide feedback to the learned results and/or directly highlight more examples.
Figure 2: System returns a few suggestions for the user to verify
STEP 3: Wait for a while…. once the system learns an accurate extractor (composed of a small number of patterns) it will inform you accordingly.
Figure 4: User reviews extracted patterns
STEP 5: Saving your pattern. If everything looks correct you can now proceed to the final stage of the process which involves saving the learned patterns for future use. Simply type in a name for your pattern in the top left corner and then click on the “Save pattern” button on the top right corner.
Supplementary section: Try out Pattern Induction
Follow these easy steps to try Pattern Induction:
- Create an IBM account and set up a Watson Discovery project as described below:
Sign up for an IBM account on Watson Discovery and then navigate over to your cloud dashboard: https://cloud.ibm.com. Click on the “Create a resource” button on the top right corner of the screen.
#WatsonDiscovery