Language Technologies in Humanities: Computational Semantic Analysis in Folkloristics. Gregor Strle, GNI ZRC SAZU Matija Marolt, UL FRI

Similar documents
CHARACTERS. ESCALUS, Prince of Verona. PARIS, a young nobleman LORD MONTAGUE LORD CAPULET. ROMEO, the Montagues son. MERCUTIO, Romeo s friend

Ballad, Identity, Love Tragedy

Instant Words Group 1

Romeo and Juliet. For the next two hours, we will watch the story of their doomed love and their parents' anger,

Romeo and Juliet. a Play and Film Study Guide. Student s Book

HAMLET. Visual Story. To help prepare you for your visit to Shakespeare s Globe. Relaxed Performance Sunday 12 August, 1.00pm

!! The!Wave! by#morton#rhue# # # # # # # Students #handout# # # #

Answer the questions after each scene to ensure comprehension.

THE PRINCESS AND THE FROG. G1C Annual show

Romeo & Juliet Notes

ROMEO AND JULIET FINAL TEST STUDY GUIDE 8 th Grade Ms. Frazier

Lesson Objectives. Core Content Objectives. Language Arts Objectives

eéåxé tçw ]âä xà by William Shakespeare

2. What do you think might have caused the feud between the Capulets and the Montagues?

Act III The Downfall

Literary Theory* Meaning

Nicolas ROMEO AND JULIET WILLIAM SHAKESPEARE : Ppppppp

ETHNOMUSE: ARCHIVING FOLK MUSIC AND DANCE CULTURE

Metonymy and Metaphor in Cross-media Semantic Interplay

9.1.3 Lesson 19 D R A F T. Introduction. Standards. Assessment

Scene 1: The Street.

2. to grow B. someone or something else. 3. foolish C. to go away from a place

Heights & High Notes

CALCULATING SIMILARITY OF FOLK SONG VARIANTS WITH MELODY-BASED FEATURES

Romeo and Juliet Chapter Questions

Grade 2 Book of Stories

Write a summary of the text in English, including the most important points, using your own words whenever possible (maximum 50 words,).

7. Describe the Montague boys both their physical appearances and their actions.

OPNION CORNER No. 10 1

Anatomy of a Fairy Tale Class Discussion Guide

A Cultural History of Gambling. Don Feeney Research and Planning Director Minnesota Lottery

Instructions. Question. Student Name: Pickering High School ENG3U Exam 2 hours June Teacher: Mr. Davis

- Act 2, Scene 1. Romeo was feeling depressed because he had to leave Juliet at the end of Act 1.

Bluegrass Music: Chopping and Singing Songs of Sorrow A Smithsonian Folkways Lesson Designed by: Claire M. Anderson University of Washington

Group Work Activity: Finishing Up Romeo and Juliet

Romeo and Juliet: A Digital Folio

Romeo and Juliet Dialectical Journal Act 1. Act 1

2. WINTER. William Shakespeare ( )

Visual Story for the Relaxed Performance of Prince Hamlet. January 27, :30PM Frederic Wood Theatre at UBC

BENVOLIO Am I really like one of those guys?

Contents. Poetry from different cultures. Reading non-fiction and media texts. Exam board specification map. Introduction.

Exam: Romeo & Juliet

ACT II MACBETH. I have done the deed. -Macbeth (line 19) Name

Romeo & Juliet ACT 4. Revision Recap

History of Tragedy. English 3 Tragedy3 Unit

Tender Mercies Romans 12: 1-2 Neil Dunnavant

The characteristics of the genre of the Russian school theatre plays of the XVII century.

The Tsar Saltan. 333 West 4 th Avenue P: (907)

The Crucible. Remedial Activities

ACT 1. Montague and his wife have not seen their son Romeo for quite some time and decide to ask Benvolio where he could be.

to believe all evening thing to see to switch on together possibly possibility around

Answer the following questions: 1) What reasons can you think of as to why Macbeth is first introduced to us through the witches?

The Tragedy of Romeo and Juliet

Romeo and Juliet. a Play and Film Study Guide. Teacher s Book

Reader s Log Romeo & Juliet

Jesus Heals A Little Girl

Antigone Prologue Study Guide. 3. Why does Antigone feel it is her duty to bury Polyneices? Why doesn t Ismene?

Extra 1 Listening Test B1

Text Types: Oral Forms

HAPPINESS TO BURN by Jenny Van West Music / bmi. All rights reserved

Test Review - Romeo & Juliet

Complete all the questions and tasks in green.

BBC LEARNING ENGLISH Gulliver's Travels 4: Voyage to Brobdingnag

Romeo & Juliet- Act 3

Classical. James A. Selby. Characterization Stage Discovering the Skills of Writing

Get ready to take notes!

Act I scene i. Romeo and Juliet Dialectical Journal Act 1

Extra 1 Listening Test B1

Prestwick House. Activity Pack. Click here. to learn more about this Activity Pack! Click here. to find more Classroom Resources for this title!

Hippolyta Oh dear husband, you are wise in so many ways, but we ve got to work on your vocabulary.

Symbols and Cinematic Symbolism

Escalus: Paris: Montague and. Capulet:

THE OTHER SIDE OF THE DOOR

Robin Hood. LEVEL NUMBER LANGUAGE Advanced C1_1064S_EN English

Hamlet: Act II. But in the beaten way of friendship, / what make you at Elsinore? / To visit you, my lord, no other

Things Fall Apart Study Guide - Part One

```````````````````````````````````````````````````````

DISCUSSION: Not all the characters listed above are used in Glendale Centre

English 9 Romeo and Juliet Act IV -V Quiz. Part 1 Multiple Choice (2 pts. each)

Audition Information. Show : Les Belles-soeurs Genre: Comedy Director: Michael Serres Producer: Graeme Powell Stage Manager: Ellie Patte

ENGLISH LANGUAGE DEVELOPMENT 6-8 READING: Literary Response and Analysis

Everybody wants to rule the world Welcome to your life There's no turning back Even while we sleep We will find you

THE BAMS DAILY. 5th Issue

I Miss You Honorable Mention

I mun be married on Sunday by Benjamin Britten a Friday Afternoons song

How the Beggar Boy Turned into Count Piro

1. At the beginning of this act, Paris thinks that Juliet is upset and crying over.

Hansel and Gretel. A One Act Play for Children. Lyrics by Malcolm brown Script and score by David Barrett. Copyright Plays and Songs Dot Com 2005

Lesson 84 - The Boy Who Cried Wolf

CURRICULUM CATALOG. English Grade 11 (1150) VA

Excerpt from Romeo and Juliet, Act 3, Scene 3

YOU LL BE IN MY HEART. Diogo dos Santos Figueira. Leiria, Portugal

TARTUFFE. Moliere. Monday, November 5, 12

Romeo and Juliet By William Shakespeare. 1 st Prologue 1. The prologue is a, a popular form of verse when the play was written in 1595.

Directions: Read the following passage then answer the questions below. The Lost Dog (740L)

Wear White and Grieve Analysis. In today s society, we have a variety of people, ideas, beliefs, etc.; but there was a time

1. jester A. feeling sad you are not with people or things. 4. together D. something that is the only one of its kind

Anglo-Saxon Period. The Anglo-Saxon period is the earliest recorded time period in English history.

Aim is catharsis of spectators, to arouse in them fear and pity and then purge them of these emotions

Duffy Higher Scottish Texts

Transcription:

Language Technologies in Humanities: Computational Semantic Analysis in Folkloristics Gregor Strle, GNI ZRC SAZU Matija Marolt, UL FRI JT DH 29. 9. 2016

Folk Song Lyrics Can we analyze lyrics and infer song type (e.g. love, moral, legendary, drinking ) relations between songs Melodies in oral traditions are often borrowed, transferred between songs love? moral? legendary? death? drinking? family? 2

Goal Three experiments on a corpus of Slovenian folk song lyrics can we discover topics and conceptual structure of songs? can we classify/group songs according to the topics they describe 3

Corpus Newly created from books Slovenske ljudske pesmi I-V ZRC SAZU (1970-2007) scan/ocr 4095 Slovenian folk narrative poems from 18th century on 349 variants from 1 to 180 songs per variant 4

Conversion Separate lyrics, metadata 5

Conversion 1. Replacement Rules symbols characteristic of dialect groups (semivowels, diphthongization, pitch accent etc.) are replaced by their grammatical equivalents 2. A dialect dictionary is used to translate the words into literary language >18000 words/forms 3. Morphosyntactic tagger for the Slovenian language Obeliks was used for lemmatization tags the words with morphological features provides lemmas bešta bǝt beteg tecita biti bolečin 6

Experiments Narrow context, just 2 song families: love and fate conflicts family fates and conflicts Themes related to death, murder, suicide, infidelity, punishment, e.g. Death of a bride before wedding Nun s suicide for love Unfaithful student Poisoning of own sister Strong intertextuality traveling of verses, motifs, and thematic patterns from one song to the other 7

Experiment one LSA LDA not as good in detecting heterogeneity (three variant types detected) the resulting semantic space generalizes towards the most salient aspects of the corpus can associate topics with different variant types more even distribution across topics LSA variant types and dimensions DEATH OF A BRIDE BEFORE WEDDING d1: mother child young baby shepherd wreath blood d4: Ljubljana linden lover boy seduce chamber Tonček d5: Breda Ljubljana groom mother-in-law linden baby Turk d6: Breda accident evil house mother-in-law sister groom d8: Ljubljana brother linden sea shirt prefer wash lover NUN S SUICIDE FOR LOVE d2: convent Ursula nun baptism godmother ring blood d3: convent Ursula nun baptism godmother shepherd wreath HUNTER SHOOTS HIS LOVER AND HIMSELF d7: newpriest grave bury church rifle hunter student d9: Ljubljana linden rifle grave hunter shaking leaves d10: rifle hunter shaking Tonček leaves face pale LDA variant types and topics DEATH AT A REUNION t1: heart boy Breda head sad hunter Danube MURDER OUT OF JEALOUSY t2: love sword kneel sharp neighbor boyfriend blame BRIDE INFANTICIDE t3: home shepherd Mary uncle birth shred rockcradle UNFAITHFUL STUDENT/NEW PRIEST t4: undertaker love priest parish love promise letter NUN'S SUICIDE FOR LOVE t5: love Uršika convent boy Jesus farewell sword REJECTED LOVER t6: seduce blood house Vida linden Ljubljanians death WIDOWER ON BRIDE'S GRAVE t7: tender abandon blood bread jesus rockcradle married ABANDONED ORPHANS t8: bury window chamber wound grow crying dead PUNISHMENT FOR THE WICKED SONS AND DAUGHTERS-IN-LAW t9: gold sea mountain rooster fear crying darling son MISTRESS' LOYALTY REPAID t10: boy fenced heart nosegay dead grieve loyal LSA LDA Voronoi diagram represents topological projections of both methods 9

Experiment two Do LDA topics correspond to song families? can we distinguish between love and fate conflicts vs. family fates and conflicts difficulty: intertextuality, themes in both are similar Agglomerative hierarchical clustering to cluster variant types according to similarity of their average topic distributions Result the semantic space does include some notion of song families enables us to place individual (also new or unknown) songs into this space and study their relations to existing materials. family clusters 1 (2:6) and 4 (13:31) hunter earth unfortunately rifle son mother remember noble castle son stand cry dress letter dress give mother wife children find gold adultery measure colorful stick boy mountain will water mother hero angry dam girlfriend mother-in-law brother father house dear ours sister see tender live leave quickly name call barely crown world beg love clusters 2 (17:11) and 3 (6:4) field three maid sun golden like ark sea lover things husband voice eat say young white know sin school mistress unlock boy saint window pot die lie stepmother run home getup graveyard rough get out go home 10

Experiment three Can LDA detect major themes characteristic for individual variant types Supervised learning: Labeled LDA predefined labels for topical distributions LLDA learns topic distributions for the labels Manually annotated selected variants with labels (18% of the corpus) trained the model Inference on the entire corpus yields distributions over labels for each song 11

Experiment three Most variants share multiple topics, with the main topic for each shown as most salient e.g. Mother prevents her son s marriage Disambiguation of similar topics (e.g. unhappy love) 12

Side project - TextExplore Enable non-programmers to experiment with topic models 13

Side project - TextExplore Enable non-programmers to experiment with topic models import corpus create topic models (Mallet) visualize documents, topics, time, location 14

Conclusion LDA can uncover typical characteristics of individual variant types enables classification of unknown materials discover relationships (similarities and differences) in the corpus Future work: more song families further develop vizualization, exploration relations between lyric and melodic spaces 15