AN EXAMPLE FOR NATURAL LANGUAGE UNDERSTANDING AND THE AI PROBLEMS IT RAISES

Similar documents
mcs 2015/5/18 1:43 page 15 #23

In Defense of the Contingently Nonconcrete

What is Character? David Braun. University of Rochester. In "Demonstratives", David Kaplan argues that indexicals and other expressions have a

UNIT 7 A CLOSE SHAVE. Script [Track 45]

1/8. The Third Paralogism and the Transcendental Unity of Apperception

SYSTEM-PURPOSE METHOD: THEORETICAL AND PRACTICAL ASPECTS Ramil Dursunov PhD in Law University of Fribourg, Faculty of Law ABSTRACT INTRODUCTION

Check back at the NCTM site for additional notes and tasks next week.

Scientific Philosophy

1/6. The Anticipations of Perception

On Recanati s Mental Files

Aristotle s Metaphysics

Replies to the Critics

INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN ICED 05 MELBOURNE, AUGUST 15-18, 2005 GENERAL DESIGN THEORY AND GENETIC EPISTEMOLOGY

Philosophy 405: Knowledge, Truth and Mathematics Spring Russell Marcus Hamilton College

Manuel Bremer University Lecturer, Philosophy Department, University of Düsseldorf, Germany

Sidestepping the holes of holism

Introduction p. 1 The Elements of an Argument p. 1 Deduction and Induction p. 5 Deductive Argument Forms p. 7 Truth and Validity p. 8 Soundness p.

Conceptions and Context as a Fundament for the Representation of Knowledge Artifacts

WRITING SHORT REPORTS AND LEADS. Chapter 4 All the News: Writing and Reporting for Convergent Media Multimedia Reporting Course

ENGLISH PAPER 1 (LANGUAGE)

Philosophy Department Expanded Course Descriptions Fall, 2007

Is Hegel s Logic Logical?

Reply to Stalnaker. Timothy Williamson. In Models and Reality, Robert Stalnaker responds to the tensions discerned in Modal Logic

Appendix B. Elements of Style for Proofs

MIMes and MeRMAids: On the possibility of computeraided interpretation

Mathematical Principles of Fuzzy Logic

Conditional Probability and Bayes

AskDrCallahan Calculus 1 Teacher s Guide

The Square of Opposition: Innovations in Teaching Logic

INTRODUCTION TO AXIOMATIC SET THEORY

MITOCW ocw f07-lec02_300k

Consistency and Completeness of OMEGA, a Logic for Knowledge Representation

Criterion A: Understanding knowledge issues

CINEMA SIX Emergency and Safety Procedures

Computing, Artificial Intelligence, and Music. A History and Exploration of Current Research. Josh Everist CS 427 5/12/05

Musical Creativity. Jukka Toivanen Introduction to Computational Creativity Dept. of Computer Science University of Helsinki

Elements of Style. Anders O.F. Hendrickson

Vagueness & Pragmatics

Crime and Punishment. Before you read Work with a partner. Have you read a newspaper today? What was the headline?

MATH 195: Gödel, Escher, and Bach (Spring 2001) Notes and Study Questions for Tuesday, March 20

THE PARADOX OF ANALYSIS

Nissim Francez: Proof-theoretic Semantics College Publications, London, 2015, xx+415 pages

Phenomenology Glossary

Informatics Enlightened Station 1 Sunflower

Review of Krzysztof Brzechczyn, Idealization XIII: Modeling in History

Peirce's Remarkable Rules of Inference

PART II METHODOLOGY: PROBABILITY AND UTILITY

ENGLISH FILE. 3 Grammar, Vocabulary, and Pronunciation A. 1 Complete the sentences with the correct passive

A Functional Representation of Fuzzy Preferences

cse371/mat371 LOGIC Professor Anita Wasilewska

Plato s work in the philosophy of mathematics contains a variety of influential claims and arguments.

CONTINGENCY AND TIME. Gal YEHEZKEL

SECTION EIGHT THROUGH TWELVE

Incommensurability and Partial Reference

Foundations in Data Semantics. Chapter 4

A Sherlock Holmes story The Norwood Builder by Sir Arthur Conan Doyle Chapter 1

that would join theoretical philosophy (metaphysics) and practical philosophy (ethics)?

1/10. The A-Deduction

Louis Althusser s Centrism

Ontology as a formal one. The language of ontology as the ontology itself: the zero-level language

FINAL EXAMINATION Semester 3 / Year 2010

Naïve realism without disjunctivism about experience

English Language Lesson two Dr. S. Fiala

Internal Realism. Manuel Bremer University Lecturer, Philosophy Department, University of Düsseldorf, Germany

6 Bodily Sensations as an Obstacle for Representationism

Action, Criticism & Theory for Music Education

1/8. Axioms of Intuition

Active Voice vs. Passive Voice (#25. Grammar Girl here. Today's topic is active voice versus passive voice. Here's Brian from Iowa:

BBC LEARNING ENGLISH Jamaica Inn 5: Lost on the moor

Learning to Listen.. and Defusing a Hostile Situation. Course Outline

Introduction Section 1: Logic. The basic purpose is to learn some elementary logic.

Moral Judgment and Emotions

Partitioning a Proof: An Exploratory Study on Undergraduates Comprehension of Proofs

For every sentences A and B, there is a sentence: A B,

This is a vocabulary test. Please select the option a, b, c, or d which has the closest meaning to the word in bold.

Is Genetic Epistemology of Any Interest for Semiotics?

SocioBrains THE INTEGRATED APPROACH TO THE STUDY OF ART

138 Great Problems in Philosophy and Physics - Solved? Chapter 11. Meaning. This chapter on the web informationphilosopher.com/knowledge/meaning

Resemblance Nominalism: A Solution to the Problem of Universals. GONZALO RODRIGUEZ-PEREYRA. Oxford: Clarendon Press, Pp. xii, 238.

Logic and Philosophy of Science (LPS)

Introduction: A Musico-Logical Offering

Aristotle s Modal Syllogistic. Marko Malink. Cambridge Harvard University Press, Pp X $ 45,95 (hardback). ISBN:

English as a Second Language Podcast ENGLISH CAFÉ 131

Physics 277:Special Topics Medieval Arms and Armor. Fall Dr. Martin John Madsen Department of Physics Wabash College

Reading. Source: Breakingnewsenglish.com

Etna Builder - Interactively Building Advanced Graphical Tree Representations of Music

PHI 3240: Philosophy of Art

Chudnoff on the Awareness of Abstract Objects 1

Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You. Chris Lewis Stanford University

The second disease is very common: there are many books that violate the principle of having something to say by trying to say too many things.

Designing a Deductive Foundation System

February 16, 2007 Menéndez-Benito. Challenges/ Problems for Carlson 1977

The Language Revolution Russell Marcus Fall Class #7 Final Thoughts on Frege on Sense and Reference

11. SUMMARY OF THE BASIC QUANTIFIER TRANSLATION PATTERNS SO FAR EXAMINED

1/ 19 2/17 3/23 4/23 5/18 Total/100. Please do not write in the spaces above.

Non-Classical Logics. Viorica Sofronie-Stokkermans Winter Semester 2012/2013

Mixing Metaphors. Mark G. Lee and John A. Barnden

12th Grade Language Arts Pacing Guide SLEs in red are the 2007 ELA Framework Revisions.

Unit 10: rules and regulation

Critical Analytical Response to Literature: Paragraph Writing Structure

Transcription:

AN EXAMPLE FOR NATURAL LANGUAGE UNDERSTANDING AND THE AI PROBLEMS IT RAISES John McCarthy Computer Science Department Stanford University Stanford, CA 94305 jmc@cs.stanford.edu http://www-formal.stanford.edu/jmc/ 1976 1 THE STORY The following story from the New York Times is my candidate for a target for a natural language understander. The story is about a real world event, and therefore the intentions of the author are less relevant for answering questions than for made up stories. The main goal of this discussion is to say what a person who has understood the story knows about the event. This seems to me to be preliminary to making programs that can understand. A 61-year old furniture salesman was pushed down the shaft of a freight elevator yesterday in his downtown Brooklyn store by two robbers while a third attempted to crush him with the elevator car because they were dissatisfied with the $1,200 they had forced him to give them. The buffer springs at the bottom of the shaft prevented the car from crushing the salesman, John J. Hug, after he was pushed from the first floor to the basement. The car stopped about 12 inches above him as he flattened himself at the bottom of the pit. 1

Mr. Hug was pinned in the shaft for about half an hour until his cries attracted the attention of a porter. The store at 340 Livingston Street is part of the Seaman s Quality Furniture chain. Mr. Hug was removed by members of the Police Emergency Squad and taken to Long Island College Hospital. He was badly shaken, but after being treated for scrapes of his left arm and for a spinal injury was released and went home. He lives at 62-01 69th Lane, Maspeth, Queens. He has worked for seven years at the store, on the corner of Nevins Street, and this was the fourth time he had been held up in the store. The last time was about one year ago, when his right arm was slashed by a knife-wielding robber. An intelligent person or program should be able to answer the following questions based on the information in the story: 1. Who was in the store when the events began? Probably Mr. Hug alone. although the robbers might have been waiting for him, but if so, this would have probably been stated. What did the porter say to the robbers? Nothing, because the robbers left before he came. 2. Who was in the store during the attempt to kill Mr. Hug? Mr. Hug and the robbers. 3. Who had the money at the end? The robbers. 4. Is Mr. Hug alive today? Yes, unless something else has happened to him. 5. How did Mr. Hug get hurt? Probably when he hit the bottom of the shaft. 6. Where is Mr. Hug s home? (A question whose answer requires a literal understanding of only one sentence of the story.) Does Mr. Hug live in Brooklyn? No, he lives in Queens. 7. What are the names and addresses of the robbers? This information is not available. 8. Was Mr. Hug conscious after the robbers left? Yes, he cried out and his cries were heard. 9. What would have happened if Mr. Hug had not flattened himself at the bottom of the pit? What would have happened if there were no buffer springs? Mr. Hug would have been crushed. 10. Did Mr. Hug want to be crushed? No. 11. Did the robbers tell Mr. Hug their names? No. 2

12. Were the robbers present when the porter came? No. 13. Did Mr. Hug like the robbers, and did they like him? 14. Why did the robbers leave without killing Mr. Hug? Perhaps, they thought they had killed him, and perhaps their anger was appeased by the actions they had performed, and perhaps they had taken all the time they dared, and perhaps something specific happened to frighten them away. 15. What would have happened if Mr. Hug had tried to run away? Perhaps he would have succeeded, but more likely they would have injured or killed him since probably they had weapons, and there were three of them. 16. What can Mr. Hug do to avoid this in the future? No solution is entirely satisfactory. He could carry a gun or he could quit or he could get his employers to install an alarm system or maybe he will be lucky. 17. Did Mr. Hug know he was going to be robbed? Does he know that he was robbed? 18. Was Mr. Hug s right arm slashed before his left arm was scratched? Yes, because the former was a year ago. 19. How did the robber try to crush him with the car? By pressing the buttons or operating the control lever to make the car go to the bottom of the shaft. 20. Why did Mr. Hug yell from the bottom of the elevator shaft? So as to attract the attention of someone who would rescue him. 21. How long did the events take? More than half an hour but less than a day. Most of the time was spent by Mr. Hug filling out forms in the hospital. 22. What crimes were committed? This question has the advantage that it is one that is normally answered on the basis of such a story, since the police report of the incident was probably the basis of the New York Times story. Robbery, possibly assault with a deadly weapon, and attempted murder are the more obvious crimes. One might specifically challenge natural language systems to answer this question. The above list of questions is rather random. I doubt that it covers all facets of understanding the story. It would be worthwhile to try to make up a list of questions that does cover substantially all aspects of the story in order to get as complete as possible an intuitive idea of what capabilities are involved in understanding such a story. Note that the story is about a real event so that such a question as what does the J in John J. Hug stand for has an answer. In a made-up story, questions about middle names or what year the story occurred in do not necessarily have an answer, and an intelligent person or program would 3

know that too. 2 ARTIFICIAL NATURAL LANGUAGE I think that artificial intelligence is not very close to being able to understand such stories in a genuine way. Therefore, I would like to sneak up on it gradually by dividing the problem into parts which can be attacked separately. Here are some of the components: 1. A formalism capable of expressing the assertions of the sentences free from dependence on the grammar of the English language. A good test for such a formalism would be to produce a program for translating from the formalism into any of several natural languages. More weakly, it should be as easy for a human to translate from the formalism into a natural language as to translate from one known natural language to another. Let s call this formalism an artificial natural language ANL for short. The grammar of ANL should be trivial and mathematical in character. There would be an English version in which English words were used as identifiers, but there would still have to be a glossary that gives the precise meaning of the identifiers. There would also be a German and a Japanese version. The translation from the English version to the German or Japanese version would be a simple substitution for identifiers, and a German or Japanese who had learned the grammar could then translate into his language with the aid of the German or Japanese glossary. This idea has some resemblance to the idea of deep structure, but I have some doubts about whether either idea is well enough defined to say. 2. A data structure for expressing the facts (apart from expressing the sentences). In such a data structure, it would be definite which robber pushed Mr. Hug first, and what the robbers said even though it is not stated in the story. Clearly some compromise is necessary here, since the data structure need not be able to express positions and velocities of molecules. Like the PLANNER languages, as Robert Moore has characterized them in his 1976 MIT Master s thesis, the descriptions would contain no disjunctions, and might be a collection of relations with constants as arguments where every relation not asserted (in a certain class) is automatically denied. Alternatively, the basis of this data structure might be various networks of nodes described by sentences in the predicate calculus. Some of the sentences would assert that certain programs applied to the data structures would 4

answer certain questions. When such sentences existed, reasoning would include the operation of the programs. In this way, we would expect to avoid the extreme prolixity that arises when we attempt to do even simple calculations by pure predicate calculus deduction. The test of success for the data structure would be that a human could readily formally deduce the answers to the above questions using a proof checker. Most of the proof-checker would be straightforward, but there is a major problem concerned with when it is possible to jump to a conclusion. 3. I see each of the following problems as a difficult AI problem: a. A parser that takes English into ANL. b. An understander that constructs the facts from a text in the ANL. c. Expression of the general information about the world that could allow getting the answers to the questions by formal reasoning from the facts and the general information. The general information would also contain non-sentence data structures and procedures, but the sentences would tell what goals can be achieved by running the procedures. In this way, we would get the best of the sentential and procedural representations of knowledge. d. A problem solver that could answer the above questions on the basis of the facts. We imagine the questions to be expressed in the fact language and expect the answers in the fact language, i.e. we avoid grammar problems in both understanding the questions and in expressing the answers. 3 THE USE OF LOGICAL FORMULAS When my understander has digested the story of Mr. Hug, it will have added one or more predicate calculus sentences to its data base. One sentence will do if it has the form ep1p2g1g2e1e2... ( event(e) person(p1) name(p1) = JohnJ.Hug g1 Robbers...). In this form, all the entities involved in expressing the facts of the story are existentially quantified variables. The only constants in the formula would have been present in the system previously. However, it is probably better to use a collection of sentences introducing a collection of individual constants. In this case, there will be 20 or so new individual constants 5

representing people, groups of people, the main event and its sub-events, places, organizations, etc. In representing the robbers, the system has a choice of representing them by three individual constants, R1, R2, and R3 or by using a single symbol G1 to represent the group of robbers. A good system will probably use both. If the number of robbers were not specified, we would have to use a constant for the group. We have to identify the robber who operated the elevator while the others pushed Mr. Hug into the shaft. We shall call him R1. The other two are not discriminated in the story, but there is no harm in our calling them R2 and R3, even if there is no information to discriminate them. If there were 20 robbers, it would be a mistake to give them all individual names. Suppose it had further been stated that as the robbers left one of them threatened to return and kill Mr. Hug later but that it was not stated whether this robber was the same one who operated the elevator. We could designate this robber by R4, but we would not have sentences asserting that R4 was distinct from R1, R2 and R3; instead we would have a sentence asserting that R4 was one of these. It is tempting to identify the group of robbers with the set R1, R2, R3, but we may want to give the group some properties not enjoyed by the set of its members. Sentences with plural subjects express some rather tricky concepts. Thus, the group robbed the store, and this is not an assertion that each member robbed the store. The members of the police emergency squad presents a similar problem. We don t want to assert how many there were. In this connection, it may be worthwhile to distinguish between what happened and what we wish to assert about what happened. A language adequate to describe what happened would not have to leave the number of policemen present vague and could give them each a name. In my old jargon, such a language would be metaphysically adequate though not epistemologically adequate. Devising a language that is only metaphysically adequate may be a worthwhile stage on the way to an epistemologically adequate system. By devising a language I mean defining a collection of predicate and constant symbols and axiomatizing their general properties. This language should not be peculiar to the story of Mr. Hug, but we should not require that it be completely general in the present state of the science. It is not obvious how to express what we know when we are told that Mr. Hug is a furniture salesman. A direct approach is to define an abstract entity called F urniture and a function called salesmen and to assert Hug salesmen(f urniture). 6

This will probably work although the logical connection between the abstract entity F urniture and concrete chairs and tables needs to be worked out. It would be over-simplified to identify F urniture with the set of furniture in existence at that time, because one could be a salesman of space shuttles even though there don t exist any yet. In my opinion, one should resist a tendency to apply Occam s razor prematurely. Perhaps we can identify the abstract F urniture with the an extension of the predicate that tells us whether an object should be regarded as a piece of furniture, perhaps not. It does no harm to keep them separate for the time being. This case looks like an argument for using second order logic so that the argument of salesmen could be the predicate furniture that tells whether an object is a piece of furniture. However, there are various techniques for getting the same result without the use of second order logic. 4 THE NEED FOR NONMONOTONIC REA- SONING After reading the story, one is prepared to answer negatively the question of whether there was someone else besides Mr. Hug and the robbers present. However, sentences describing such another person could be added to the story without contradiction. Our basis for the negative answer is that we can construct a model of the facts stated in the story without such a person, and we are applying Occam s razor in order to not multiply entities beyond necessity. This could be attributed to the fact that the New York Times tells the whole story when it can, but I think that by putting Occam s razor into the system, we can get this result without having to formalize the New York Times. This suggests introducing the notion of the minimal completion of a story expressed in the predicate calculus. The minimal completion of the story is also a set of sentences in the predicate calculus, but it contains sentences asserting things like The set of people in the store while the robbers were trying to crush Mr. Hug consists of Mr. Hug and the robbers. These sentences are to be obtained from the original set by the application of a process formalizing Occam s razor. This process works from a set of sentences and is not logical deduction although it might be accomplished by deduction in a meta-language that contained sentences about sets of sentences. As I have 7

pointed out elsewhere, the process cannot be deduction, because it generates sentences that contradict sentences that are consistent with the original set of sentences. A number of the questions given in the previous section have answers that can be formally deduced from the minimal completion but not from the original list. It has been suggested that probabilistic reasoning should be used to exclude the presence of other people rather than Occam s razor. The problem with this is that the number of additional entities that are not logically excluded is limited only by one s imagination so that it is not clear how one could construct a probabilistic model that took these possibilities into account only to exclude them as improbable. If one wants to introduce probabilities, it might make more sense to assign a probability to the correctness of the minimal completion of a New York Times story based on its past record in finding the relevant facts of robberies. Another problem in constructing the completion is the isolation of the story from the rest of the world. The members of the Police Emergency Squad all have mothers (living or dead), but we don t want to bring them into the completion not to speak of bringing in more remote ancestors all of whom can be asserted to exist on the basis of axioms about people. 5 CONCLUSION To recapitulate: The original set of predicate calculus sentences can be generated from the story as one goes along. Each sentence is generated approximately from a sentence of the story with the aid of general knowledge and what has been generated from the previous sentences. (This will usually be the case if the story is well told although there are sometimes cases in which the correct way to express a sentence will depend on what follows - but this is not good writing.) The completion, however, will depend on the whole of the story. It might be interesting to consider what can be determined from a partial reading of the story even stopping the reading in the middle of a sentence since what has appeared so far in a sentence often must be understood in order to even parse the re... /@steam.stanford.edu:/u/ftp/jmc/mrhug.tex: begun 1996 May 14, latexed 1996 May 14 at 9:31 a.m. 8