Rhetorical Structure Theory

Similar documents
Sentence Processing. BCS 152 October

DIRECTORATE-GENERAL III INDUSTRY Legislation and standardization and telematics networks Standardization

Exploiting Cross-Document Relations for Multi-document Evolving Summarization

Glossary of Rhetorical Terms*

COMPUTER ENGINEERING SERIES

CHAPTER 2 REVIEW OF RELATED LITERATURE. advantages the related studies is to provide insight into the statistical methods

Sentence and Expression Level Annotation of Opinions in User-Generated Discourse

OKLAHOMA SUBJECT AREA TESTS (OSAT )

Sentence Processing III. LIGN 170, Lecture 8

The Object Oriented Paradigm

Comparative Rhetorical Analysis

Digital Audio and Video Fidelity. Ken Wacks, Ph.D.

The central or main idea of a nonfiction text is the point the author is making about a topic.

Code : is a set of practices familiar to users of the medium

Undertaking Semiotics. Today. 1. Textual Analysis. What is Textual Analysis? 2/3/2016. Dr Sarah Gibson. 1. Textual Analysis. 2.

TEN FOR TEN. 1. Theater audiences in the 1980 s saw more musical comedies than the 1970 s or 1990 s.

Cedar Rapids Community School District

Rhetorical relations in multimodal documents

Presentation Overview

Copyright, quotations and figures in your report

Formalizing Irony with Doxastic Logic

The ACL Anthology Network Corpus. University of Michigan

ENGLISH 1201: Essays and Prose

What is Character? David Braun. University of Rochester. In "Demonstratives", David Kaplan argues that indexicals and other expressions have a

Using synchronic and diachronic relations for summarizing multiple documents describing evolving events

Sentiment Aggregation using ConceptNet Ontology

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder

A STEP-BY-STEP PROCESS FOR READING AND WRITING CRITICALLY. James Bartell

Correlation to Common Core State Standards Books A-F for Grade 5

Restrictive relative clause constructions as implicit coherence relations

Standard 2: Listening The student shall demonstrate effective listening skills in formal and informal situations to facilitate communication

Influence of lexical markers on the production of contextual factors inducing irony

Dimensions of Argumentation in Social Media

Possible Ramifications for Superiority

Cecil Jones Academy English Fundamentals Map

Writing the Annotated Bibliography for English/World History Synthesis Essay

District of Columbia Standards (Grade 9)

Generation of Video Documentaries from Discourse Structures

Abstracts workshops RaAM 2015 seminar, June, Leiden

Lesson 10 November 10, 2009 BMC Elementary

Chapter III. Research Methodology. A. Research Design. constructed and holistically as stated by Lincoln & Guba (1985).

ILAR Grade 7. September. Reading

Modelling Intellectual Processes: The FRBR - CRM Harmonization. Authors: Martin Doerr and Patrick LeBoeuf

The BBC s Draft Distribution Policy. Consultation Document

NEW MEXICO STATE UNIVERSITY Electrical and Computer Engineering Department. EE162 Digital Circuit Design Fall Lab 5: Latches & Flip-Flops

Honors Ninth Literature and Composition Summer 2017 Reading Assignment

Arkansas Learning Standards (Grade 12)

BBC Distribution Policy June 2018

Building blocks of a legal system. Comments on Summers Preadvies for the Vereniging voor Wijsbegeerte van het Recht

General Educational Development (GED ) Objectives 8 10

Explicit Discourse Connectives Implicit Discourse Relations

Grade 6 Overview texts texts texts fiction nonfiction drama texts author s craft texts revise edit author s craft voice Standard American English

The topic of this Majors Seminar is Relativism how to formulate it, and how to evaluate arguments for and against it.

English II STAAR EOC Review

The Power of Ideas: Milton Friedman s Empirical Methodology

Argumentation and persuasion

Chapter 6. Flip-Flops and Simple Flip-Flop Applications

Identifying functions of citations with CiTalO

Etna Builder - Interactively Building Advanced Graphical Tree Representations of Music

Valuable Particulars

Using Synchronic and Diachronic Relations for Summarizing Multiple Documents Describing Evolving Events

English. Mark Schemes. Cambridge International Primary Achievement Test November 2006

There will be 10 point deducted each day that the project is late. All projects should include the student s name and section!

for Using School to Home Reading for Preschool, Kindergarten, and Primary Children

The identity theory of truth and the realm of reference: where Dodd goes wrong

Bell Ringer. Death is not the greatest loss in life. The greatest loss is what dies inside us while we live. -Norman Cousins

Arkansas Learning Standards (Grade 10)

The Rhetorical Triangle

An Analysis of Puns in The Big Bang Theory Based on Conceptual Blending Theory

Sentiment Analysis. Andrea Esuli

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli

Language Paper 1 Knowledge Organiser

K-12 ELA Vocabulary (revised June, 2012)

ENGLISH 2201: Essays and Prose

Correlated to: Massachusetts English Language Arts Curriculum Framework with May 2004 Supplement (Grades 5-8)

1. I can identify, analyze, and evaluate the characteristics of short stories and novels.

A User-Oriented Approach to Music Information Retrieval.

Writing paragraphs with topic sentences >>>CLICK HERE<<<

Similarities in Amy Tans Two Kinds

Who Speaks for Whom? Towards Analyzing Opinions in News Editorials

DOES MOVIE SOUNDTRACK MATTER? THE ROLE OF SOUNDTRACK IN PREDICTING MOVIE REVENUE

Cyclic vs. circular argumentation in the Conceptual Metaphor Theory ANDRÁS KERTÉSZ CSILLA RÁKOSI* In: Cognitive Linguistics 20-4 (2009),

COMPUTER ENGINEERING PROGRAM

ISSA Proceedings 2002 Hearing Is Believing: A Perspective- Dependent Account Of The Fallacies

Some Basic Concepts. Highlights of Chapter 1, 2, 3.

Usage of provenance : A Tower of Babel Towards a concept map Position paper for the Life Cycle Seminar, Mountain View, July 10, 2006

Articulating Medieval Logic, by Terence Parsons. Oxford: Oxford University Press,

A Framework for Segmentation of Interview Videos

UNIT SPECIFICATION FOR EXCHANGE AND STUDY ABROAD

Annotated Bibliographies

CPU Bach: An Automatic Chorale Harmonization System

Aligned with Reading Comprehension Skills

Sentiment of two women Sentiment analysis and social media

IN THE MOMENT: he Japanese poetry of Haiku is often introduced to young children as a means

Philosophy of Development

A Theory of Structural Constraints on the Individual s Social Representing? A comment on Jaan Valsiner s (2003) Theory of Enablement

Formalising arguments

BBC Learning English Talk about English Live webcast Politics & Language Thursday November 23 rd, 2006

Intro to Pragmatics (Fox/Menéndez-Benito) 10/12/06. Questions 1

AP English Literature 12 Summer Reading

Transcription:

Domain-Dependent Rhetorical Model Rhetorical Structure Theory Regina Barzilay EECS Department MIT Domain: Scientific Articles Humans exhibit high agreement on the annotation scheme The scheme covers only a small fraction of discourse relations November 2, 2004 Rhetorical Structure Theory 2/26 Domain-Dependent Content Models Domain-Independent Rhetorical Model Capture topics and their distribution Are based on pattern matching techniques Motifs of semantic units Distributional model Useful in generation and summarization Model elements: Binary Relations Compositionality Principle Requirements: Stability and Reproducibility of an Annotation Scheme Expressive Power of a Model Rhetorical Structure Theory 1/26 Rhetorical Structure Theory 3/26

Informational Structure Example of Coherence Relation (1) How many different coherence relations are there? Are different taxonomies of coherence relations compatible with each other? Some real-time evidence for validity of some coherence relations: pronoun experiments (difference cause-effect/resemblance) Causal relations: Cause-Effect effect cause John is dishonest because he is a politician. Rhetorical Structure Theory 4/26 Rhetorical Structure Theory 6/26 Coherence Relations: Historic Perspective Example of Coherence Relation (2) Causal relations: Violated-Expectations John is honest although he is a politician. Aristotle Boccaccio Hume (4th cent. BC) (14th cent.) (18th cent.) John is dishonest Rhetorical Structure Theory 5/26 Rhetorical Structure Theory 7/26

Example of Coherence Relation (3) Example of Coherence Relation (5) Causal relations: Condition If someone is a politician he is dishonest Resemblance relations: Contrast John supported Gore, and Fred cheered for Bush. Rhetorical Structure Theory 8/26 Rhetorical Structure Theory 10/26 Example of Coherence Relation (4) Example of Coherence Relation (6) Resemblance relations: Parallel John organized rallies for Gore, and Fred distributed pamphlets for him. Elaborations relations: John supported Gore, and Fred cheered for Bush. Rhetorical Structure Theory 9/26 Rhetorical Structure Theory 11/26

How many coherence relations? Some accounts of coherence assume 2, other more than 400 coherence relations Hovy&Maier 1995: taxonomies with more relations represent subtypes of taxonomies with fewer relations cause-effect volitional, non-volitional Find Coherence Relations Consider this extract from The Kreutzer Sonata by L. Tolstoy (A) It is amazing how complete is the delusion that beauty is goodness. (B) A handsome woman talks nonsense, you listen and hear not nonsense but cleverness. (C) She says and does horrid things, and you see only charm. (D) And if a handsome woman does not say stupid or horrid things, you at once persuade yourself that she is wonderfully clever and moral. Rhetorical Structure Theory 12/26 Rhetorical Structure Theory 14/26 Problem: Ambiguity Rhetorical Structure Theory (Mann&Thompson:1988, Matthessen&Thompson:1988) Developed in the framework of natural language generation Aims to describe building blocks of text structure Nucleus vs Satellites Binary Relations between Discourse Units Compositionality principle defines how to build a tree from binary relations Rhetorical Structure Theory 13/26 Rhetorical Structure Theory 15/26

Example RST tree [ No matter how much one wants to stay a non-smoker, A ], [ the truth is that the pressure to smoke in junior high is greater than it will be any other time of one s life. B ]. [ We know that 3,000 teens start smoking each day, C ] [ although it is a fact that 90% of them once thought that smoking was something that they ll never do. D ] JUSTIFICATION A B C D JUSTIFICATION CONCESSION Rhetorical Structure Theory 16/26 Rhetorical Structure Theory 18/26 Binary Relations Relations (JUSTIFICATION, A, B) (JUSTIFICATION, D, B) (EVIDENCE, C, B) (CONCESSION, C, D) (RESTATEMENT, D, A) Relation Nucleus Satellite Background text whose understanding is being facilitated Elaboration basic information text whose understanding is being facilitated additional information Preparation text to be presented text which prepares the reader to expect and interpret the text to be presented Rhetorical Structure Theory 17/26 Rhetorical Structure Theory 19/26

Compositionality Automatic Computation of RST Relations Whenever two large text spans are connected through a rhetorical relation, that rhetorical relation holds between the most important parts of the constituent spans. Marcu (1997): used constraint-satisfaction approach to build discourse trees given a set of binary relations Wolf (2004): tree structure is not an adequate representation of discourse structure (Marcu, 1997) Aggregate discourse relations to a few stable groups: (contrast, elaboration, condition, cause-explanation-evidence) Establish deterministic correspondence between cue phrases and discourse relations: { But, However } Contrast { In addition, Moreover } Elaboration Rhetorical Structure Theory 20/26 Rhetorical Structure Theory 22/26 Automatic Computation of RST Relations Accuracy Compared against manually constructed trees (Marcu, 1997; Marcu&Echihabi, 2002) Surface cues for discourse relations: I like vegetables, but I hate tomatoes. Tested against human-constructed trees Automatically constructed trees exhibit high similarity with human-constructed trees However, see (Marcu&Echihabi, 2002) CONTRAST vs ELABORATION: only 61 from 238 have a discourse marker (26%) Rhetorical Structure Theory 21/26 Rhetorical Structure Theory 23/26

Other Words Also Count! Evaluation (Marcu&Echihabi, 2002) Surface cues for discourse relations: I like vegetables, but I hate tomatoes. Training data: Raw 1 billion words corpus (41,147,805 sents) BLIPP parsed corpus (1,796,386 sents) The system can compute accurately some relations (see handout) The size and the quality of the training data matters a lot Rhetorical Structure Theory 24/26 Rhetorical Structure Theory 26/26 Method Assume that certain markers unambiguously predict discourse relations Create Cartesian product of words located on two sides of a discourse marker For each pair of words, compute its likelihood to predict a discourse relation argmax rk P (r k (s 1, s 2 )) = argmax rk P ((s 1, s 2 ) r k ) P (r k ) where s i is a discourse clause, w i is a word and r k is a discourse relation P ((s 1, s 2 ) r k ) = i,j s 1,s 2 P ((w i, w j ) r k ) Rhetorical Structure Theory 25/26