Improvising hip hop lyrics via transduction grammar induction

HKUST Electronic Theses

Improvising hip hop lyrics via transduction grammar induction

by Addanki Venkata Sai Karteek

THESIS 2015

Ph.D. Computer Science and Engineering

xiv, 118 pages : illustrations ; 30 cm

Abstract

Among the many genres of language that have been studied in computational linguistics and spoken language processing, there has been a dearth of work on lyrics in music, despite the major impact that this form of language has across almost all human cultures. In this thesis, we propose theoretically motivated symbolic and distributed models for improvising lyrics in music and we choose the genre of hip hop lyrics as our domain. Through our work, we model the issues in song lyric improvisation using modern statistical language technologies and attempt to bridge the gap between language and music in natural language processing (NLP).

Firstly, we describe a novel Hidden Markov Model (HMM) based rhyme scheme detection module which identifies the rhyming scheme within a given stanza in a completely unsupervised fashion without using any linguistic or phonetic features. We use this rhyme scheme detection module to select the training data for our improvisation models so as to generate fluent and rhyming output and demonstrate that using the rhyme scheme detection module improves the model performance considerably.

Secondly, we improvise hip hop lyrics by generating responses to challenges similar to a freestyle rap battle. We model the problem of improvisation as a machine translation problem where the challenge needs to be “translated” into a response and train a bottom-up token based bracketing inversion transduction grammar (BITG) model to perform the transduction. We also propose a search heuristic in our decoding algorithm and disfluency handling strategies to improve our model output. We contrast our model with an off-the-shelf phrase based SMT (PBSMT) model and show that our model generates significantly better responses that are more fluent and rhyme better with the challenges.

We also propose a novel model that improvises rhyming and fluent responses for a hip hop lyric challenge by combining both bottom-up token based rule induction and top-down rule segmentation strategies to learn a stochastic transduction grammar. We demonstrate that a smoothed interpolation of rules generated by top-down rule segmentation and bottom-up rule induction outperforms a linear interpolation of the segmental and token based rules. We also show that both our segmental models outperform the token based BITG model on the criterion of generating rhyming and fluent responses to hip hop lyrical challenges. We also show good model performance on Maghrebi French hip hop lyrics demonstrating the language independence of our models.

Another improvisation algorithm using TRAAM, a fully bilingual generalization of Pollack’s (1990) monolingual Recursive Auto-Associative Memory neural network model, in which each distributed vector represents a bilingual constituent is also presented. TRAAM models capture cross-lingual generalizations via soft bilingual categories and hence have attractive properties which can be used for the tasks such as bilingual grammar induction and statistical machine translation approaches. Using a novel pattern completion decoding algorithm, we use a trained TRAAM model to improvise hip hop lyrics.

Lastly, we discuss the challenges in evaluating the performance on the improvisation task of evaluating hip hop lyrics as a first step toward designing robust evaluation strategies for improvisation tasks, a relatively neglected area to date. We discuss our observations regarding inter-evaluator agreement on judging improvisation quality as a means to better understand the high degree of subjectivity at play in improvisation tasks, thereby enabling the design of more discriminative evaluation strategies to drive future model development.

[ Hide abstract ]

View Copyrighted to the author. Reproduction is prohibited without the author’s prior written consent.

Details

Collection HKUST Electronic Theses Degree Ph.D. Department Computer Science and Engineering Supervisors Wu, Dekai Authors Addanki, Venkata Sai Karteek Subjects Computational linguistics Hip-hop Librettos Data processing Language English Call number Thesis CSED 2015 Addank DOI 10.14711/thesis-b1584866

Full record

Improvising hip hop lyrics via transduction grammar induction

by Addanki Venkata Sai Karteek

Post a Comment Cancel reply