Weak supervision for information extraction

HKUST Electronic Theses

Weak supervision for information extraction

by Hongliang Dai

THESIS 2021

Ph.D. Computer Science and Engineering

1 online resource (xiii, 86 pages) : illustrations (some color)

Abstract

Deep learning models have gained much success in Information Extraction (IE) from text. Such models usually require a large number of labeled samples to train. Since human annotation can be difficult and time consuming, automatically generated weak supervision is widely leveraged.

We investigate the creation and the use of weak annotations for IE with two tasks: Aspect and Opinion Term Extraction (AOTE), and Entity Typing. They belong to the two kinds of operations that an IE system needs to carry out, respectively.

First, we are interested in generating context-dependent weak annotations without much human effort. For AOTE, we propose an approach to annotating a large number of training samples with automatic annotation rules. The rules are mined from a small human labeled sample set,...[ Read more ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree Ph.D. Department Computer Science and Engineering Supervisors Song, Yangqiu Authors Dai, Hongliang Subjects Text processing (Computer science) Natural language processing (Computer science) Machine learning Language English Call number Thesis CSE 2021 Dai DOI 10.14711/thesis-991012986099803412

Full record

Weak supervision for information extraction

by Hongliang Dai

Post a Comment Cancel reply