Author(s) Name:   James Pustejovsky, Amber Stubbs
Create your own natural language training corpus for machine learning. Whether youre working with English, Chinese, or any other natural language, this hands-on book guides you through a proven annotation development cycle—the process of adding metadata to your training corpus to help ML algorithms work more efficiently. You dont need any programming or linguistics experience to get started.
Table of Contents
1. The Basics
2. Defining Your Goal and Dataset
3. Corpus Analytics
4. Building Your Model and Specification
5. Applying and Adopting Annotation Standards
6. Annotation and Adjudication
7. Training: Machine Learning
8. Testing and Evaluation
9. Revising and Reporting
10. Annotation: TimeML
11. Automatic Annotation: Generating TimeML
12. Afterword: The Future of Annotation
ISBN:  9781449306663
Publisher:  O Reilly Media, Inc Publisher
Year of Publication:  2012
Book Link:  Home Page Url