Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

Transfer Learning in Natural Language Processing - 2019

Transfer Learning In Natural Language Processing

Research Area:  Machine Learning

Abstract:

The classic supervised machine learning paradigm is based on learning in isolation, a single predictive model for a task using a single dataset. This approach requires a large number of training examples and performs best for well-defined and narrow tasks. Transfer learning refers to a set of methods that extend this approach by leveraging data from additional domains or tasks to train a model with better generalization properties. Over the last two years, the field of Natural Language Processing (NLP) has witnessed the emergence of several transfer learning methods and architectures which significantly improved upon the state-of-the-art on a wide range of NLP tasks. These improvements together with the wide availability and ease of integration of these methods are reminiscent of the factors that led to the success of pretrained word embeddings and ImageNet pretraining in computer vision, and indicate that these methods will likely become a common tool in the NLP landscape as well as an important research direction. We will present an overview of modern transfer learning methods in NLP, how models are pre-trained, what information the representations they learn capture, and review examples and case studies on how these models can be integrated and adapted in downstream NLP tasks.

Keywords:  

Author(s) Name:  Sebastian Ruder, Matthew E. Peters, Swabha Swayamdipta, Thomas Wolf

Journal name:  

Conferrence name:  Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Tutorials

Publisher name:  Association for Computational Linguistics

DOI:  10.18653/v1/N19-5004

Volume Information: