From Mnemosyne to Terpsichore - the Bilderatlas after the Image

Similar documents
Modeling memory for melodies

Image Fall 2016 Prof. Mikhail Iampolski

IMIDTM. In Motion Identification. White Paper

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Expressive performance in music: Mapping acoustic cues onto facial expressions

Enhancing Music Maps

Visual Encoding Design

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

PAINTING CINEMAPH C OT O OGR M APHY IDIGITALCILLUSTRASTIONAMATEUR

Expressive information

Abstracts. Critical Narrative: Art and Memory

Sound visualization through a swarm of fireflies

Tamar Sovran Scientific work 1. The study of meaning My work focuses on the study of meaning and meaning relations. I am interested in the duality of

MUSI-6201 Computational Music Analysis

Bibliometric glossary

CHILDREN S CONCEPTUALISATION OF MUSIC

Classification of Timbre Similarity

Analytic Comparison of Audio Feature Sets using Self-Organising Maps

Kuhn s Notion of Scientific Progress. Christian Damböck Institute Vienna Circle University of Vienna

ASSOCIATIONS BETWEEN MUSICOLOGY AND MUSIC INFORMATION RETRIEVAL

THE USE OF THOMSON REUTERS RESEARCH ANALYTIC RESOURCES IN ACADEMIC PERFORMANCE EVALUATION DR. EVANGELIA A.E.C. LIPITAKIS SEPTEMBER 2014

PAUL REDDING S CONTINENTAL IDEALISM (AND DELEUZE S CONTINUATION OF THE IDEALIST TRADITION) Sean Bowden

Creating a Feature Vector to Identify Similarity between MIDI Files

For m. The numbered artworks referred to in this handout are listed, with links, on the companion website.

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

Brief Report. Development of a Measure of Humour Appreciation. Maria P. Y. Chik 1 Department of Education Studies Hong Kong Baptist University

Adaptive Key Frame Selection for Efficient Video Coding

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

A Process of the Fusion of Horizons in the Text Interpretation

A p p l i e d I c o n o l o g y

A User-Oriented Approach to Music Information Retrieval.

TOWARDS A USER ORIENTED DESCRIPTION OF COLOUR RENDITION OF LIGHT SOURCES 2.METHODS

Extending Interactive Aural Analysis: Acousmatic Music

Music Information Retrieval with Temporal Features and Timbre

hprints , version 1-1 Oct 2008

Visualizing Euclidean Rhythms Using Tangle Theory

Welcome to My Favorite Human Behavior Hack

UNIVERSAL SPATIAL UP-SCALER WITH NONLINEAR EDGE ENHANCEMENT

The software concept. Try yourself and experience how your processes are significantly simplified. You need. weqube.

White Paper. Uniform Luminance Technology. What s inside? What is non-uniformity and noise in LCDs? Why is it a problem? How is it solved?

Introduction and Overview

Predicting the Importance of Current Papers

Digital Video Telemetry System

Browsing News and Talk Video on a Consumer Electronics Platform Using Face Detection

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC

SYNTHESIS FROM MUSICAL INSTRUMENT CHARACTER MAPS

The relationship between shape symmetry and perceived skin condition in male facial attractiveness

Exploring touch: A review of Matthew Fulkerson s The First Sense

GUIDELINES FOR THE CONTRIBUTORS

Recap: Representation. Subtle Skeletal Differences. How do skeletons differ? Target Poses. Reference Poses

Subjective Similarity of Music: Data Collection for Individuality Analysis

Kuhn Formalized. Christian Damböck Institute Vienna Circle University of Vienna

The software concept. Try yourself and experience how your processes are significantly simplified. You need. weqube.

Music Genre Classification

What makes a good logo?

Acoustic and musical foundations of the speech/song illusion

1. MORTALITY AT ADVANCED AGES IN SPAIN MARIA DELS ÀNGELS FELIPE CHECA 1 COL LEGI D ACTUARIS DE CATALUNYA

istarml: Principles and Implications

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors *

Vascular. Development of Trinias FPD-Equipped Angiography System. 1. Introduction. MEDICAL NOW No.73 (2013.2) Yoshiaki Miura

ZONE PLATE SIGNALS 525 Lines Standard M/NTSC

An Efficient Reduction of Area in Multistandard Transform Core

UNIVERSITY OF SOUTH ALABAMA PSYCHOLOGY

Development of a wearable communication recorder triggered by voice for opportunistic communication

Speech Recognition and Signal Processing for Broadcast News Transcription

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

MPEG has been established as an international standard

Performance evaluation of I 3 S on whale shark data

Detecting Musical Key with Supervised Learning

BBM 413 Fundamentals of Image Processing Dec. 11, Erkut Erdem Dept. of Computer Engineering Hacettepe University. Segmentation Part 1

Music Source Separation

Multiple-point simulation of multiple categories Part 1. Testing against multiple truncation of a Gaussian field

Transducers and Sensors

Reading One: Three Couples by Ivy C. Dally

1/9. Descartes on Simple Ideas (2)

A workshop of the mind

Goldie on the Virtues of Art

Design for Information

On the causes of subject-specific citation rates in Web of Science.

Viewer-Adaptive Control of Displayed Content for Digital Signage

1 Overview of MPEG-2 multi-view profile (MVP)

Construction of a harmonic phrase

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

An Empirical Analysis of Macroscopic Fundamental Diagrams for Sendai Road Networks

Measuring the Facets of Musicality: The Goldsmiths Musical Sophistication Index. Daniel Müllensiefen Goldsmiths, University of London

Music Similarity and Cover Song Identification: The Case of Jazz

Implementation of A Low Cost Motion Detection System Based On Embedded Linux

Research & Development. White Paper WHP 228. Musical Moods: A Mass Participation Experiment for the Affective Classification of Music

Formalizing Irony with Doxastic Logic

A Critical Response to The Psychology of Creativity: A Critical Reading

Week 25 Deconstruction

STANDARDS CONVERSION OF A VIDEOPHONE SIGNAL WITH 313 LINES INTO A TV SIGNAL WITH.625 LINES

Toward the Adoption of Design Concepts in Scoring for Digital Musical Instruments: a Case Study on Affordances and Constraints

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution.

Visual and Aural: Visualization of Harmony in Music with Colour. Bojan Klemenc, Peter Ciuha, Lovro Šubelj and Marko Bajec

(Skip to step 11 if you are already familiar with connecting to the Tribot)

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Usage of provenance : A Tower of Babel Towards a concept map Position paper for the Life Cycle Seminar, Mountain View, July 10, 2006

Transcription:

From Mnemosyne to Terpsichore - the Bilderatlas after the Image 1. Aby Warburg s Atlas of Images This study concerns Aby Warburg s last and most ambitious project: the Atlas Mnemosyne (or Bilderatlas), conceived in August 1926 and truncated three years later, unfinished, by Warburg s sudden death in October 1929. Mnemosyne consists of a series of large black panels, on which are attached a variable number of black-and-white photographs of paintings, sculptures, tarot cards, stamps, coins, and other types of images. The version we use is the one Warburg was working on at the time of his death, also known as the 1-79 version : it includes 1 around a thousand images pinned on 63 panels. The Bilderatlas is a conceptual maze - the culmination of a life s scholarship in images and memory - through which perhaps the clearest thread is the concept of Pathosformel, or formula for (the expression of) passion. Much excellent work has been written on the concept but, to the best of our knowledge, nobody has yet tried to operationalise it - to turn it into a sequence of quantitative operations, or in other words, into an algorithm (Moretti, 2013). 1 This version of the Atlas is published in various print editions, and available online at www.engramma.it/eos2/atlante

2. The Pathosformel and its Operationalisation On the most basic level, the Pathosformel describes the portrayal passionate emotions through a formula, a repeatable visual paradigm. The Pathosformel owes much of its force, as Salvatore Settis has pointed out, to its combination of semantic opposites: an oxymoronic word, in that it merges in the same term the movement of pathos and the rigidity of the formula-schema (Settis 1997). Rather than attempting to operationalise the entire concept at once, we first break it down into its constituent components: first the morphology of formula, then the dynamism of pathos. We turn to two well-studied cases of Warburgian formula: the Death of Orpheus (where Warburg first named the concept of Pathosformel), and the Nymph, headhuntress and Fortuna (Bilderatlas panels 46-48). Looking at Warburg s examples of the Orpheus-formula, reproduced in Figure 1, we can hardly stop ourselves from spotting a formula which repeats across the ages; but how could this intuitive similarity be measured? Our algorithm comes as follows: 1. We isolate each individual body from its context It is clear the Pathosformel relates to individual characters - the object of study thus becomes not panels or photographs in the Bilderatlas, but individual bodies. 2. We take only the skeletons of such bodies Here we are eliminating colour, clothes, hands, faces, gender, age. This is not to say that such factors aren t important: but they are not elementary to 2 the formula 3. We compare these skeletons by measuring the angles of the main limbs of the body, as described in Section 5. 2 See e.g. André Jolles letter to Warburg 23rd December 1900, where the formula of Ghirlandaio s Nymph hops between Judith, Salome, Tobias, Gabriel etc. (Ghelardi 2004, p. 248), or the formula in Bilderatlas Panel 47 shared between Giambologna s Samson and Donatello s Judith.

Each of these steps is not merely a convenient quantification, but a conceptual wager. This is the strength of operationalisation as a critical tool: it forces oneto be explicit about the conceptual choices one makes. Fig 1: the Death of Orpheus: details from Warburg s example figures from classical antiquity to Dürer. Reproduced from Warburg 1905/1999 3. Anatomy, Emotion and Pose Warburg was certainly influenced bydarwin stheexpression oftheemotions inman and Animals (Darwin 1872) - which, when talking about human emotions, largely concentrates on the face3. Warburg was certainly struck by the book - writing in his diary ein Buch, das mir hilft! (Gombrich, p.72). He was also interested and capable of studying the face in art4, yet never in relation to Pathos - his descriptions of the Pathosformel relate exclusively to the body. We can relate Warburg s decision to the large psychological literature on emotional recognition from bodies. Psychological studies are based on the Light Spots Model by Johansson (1973), often called biological motion, in which reducing body pose to 10-12 points - quite comparable to ourownreduction -is judged to give a compelling impression ofhuman walking, running, dancing etc.. Using only Light Spots, observers can reliably tell gender and emotion from dynamic pose (Kozlowski 1977, Montepare 1987). Indeed, it has been suggested that our emotional understanding of faces is more influenced by our perception of the body than vice versa (Van den Stock 2007). 3 Indeed, the first figure of the book is an anatomical diagram of the face - I shall often have to refer [...] to the muscles of the human face (Darwin 1872 p.23). 4 See e.g. his discussion of faces in Ghirlandaio s Confirmation in Santa Trinita, in The Art of Portraiture in the Florentine Bourgeoisie (in Warburg 1999 p.185)

4. Encoding Pathos through Pose in the Bilderatlas The Atlas is, even by today s dataset standards, quite sizeable: 1000 images across 63 panels, containing an order of 10 3-10 4 depicted human figures. Scalable manual annotation is only therefore possible through crowdsourcing, which we did through the CrowdFlower platform. Accurately annotating every visible figure in an image is a difficult and ambiguous tasks. Additionally, if different workers annotate different figures in the same image, the annotations cannot be collated or averaged. We therefore developed a two-stage annotation process: I. Human figures are extracted from the painting by drawing bounding-boxes. This is done three times per image (by three separate workers). II. Having aggregated the information from the first stage, separate images are produced for each human figure. Detailed pose information (the position of major body-points) is then added by three separate workers, with the information aggregated. It should be clarified that the decision to annotate bodies in isolation (for annotation accuracy and just worker compensation) is quite separate to the earlier conceptual decision to analyse bodies individually, which relates to the object of study. It would be quite possible to do either one without the other. Using this two-stage annotation process, we have presently annotated ⅓ of the Bilderatlas (by panels), resulting in 1,665 aggregated human poses. The collection and aggregation of the data are described in greater technical detail elsewhere (Impett, 2016).

5. Data Analysis: dimensionality reduction and dimensioned reductionism Having encoded our static poses, how do we analyse and compare a collection of human figures of different sizes, proportions and orientations? We mirrored the poses horizontally and controlled for global rotation, ending up with a 11-dimensional vector P, describing the angles of the main limbs. From this angular pose vector, we can use circular statistics to find a morphological distance D a,b between two poses P a and P b : Where P a, i is the i th angle of pose vector P a, and α is the angular radian distance: These morphological pose-differences are perceptually meaningful over short distances. On a larger scale, they become less perceptually significant: is a sitting person closer to a lying or standing person? In order to make our distance analysis perceptually relevant, therefore, we first clustered our 1,665 poses into 16 clusters by rotational K-means clustering (Dordet-Berdanet and Wicker 2008). Our two-stage clustering system is therefore as follows: I. K-means clustering (to produce meaningful clusters) II. Hierarchical clustering (for within-cluster morphological information) The number of clusters K is chosen by looking at the inter-cluster variance over K. The result of the first stage of clustering is shown in Figure 2; Figure 3 shows an example section from a hierarchical map of Cluster 1.

Fig 2: our 16 pose-clusters Fig 3: a detail from a dendrogram of different poses within the Bilderatlas, produced by second-stage clustering within Cluster 2

6. Unity of the Pathosformeln: from distant to close reading Some of the clusters clearly represent physical activities - sitting, praying, embracing, dancing - whilst others seem more subtly communicative or expressive in nature. Having reorganised the 1,665 figures into 16 mean-centred clusters, we proceeded to trace the classical Pathosformeln - identified in the Atlas by primary and secondary literature - through our clusters. The canonical Pathosformeln are mainly mythological figures (Perseus, Pentheus, Orpheus) or recurring allegories (Graces, Nymphs, Fortuna). They were previously described as distinct, and we expected to find a taxonomy of such formulae through our clustering analysis. On the contrary, the statistical result was much stronger: a complete morphological unity. Almost every identified Pathosformel falls into Cluster 1, with few false positives - over 80% of the figures in Cluster 1 are an identified Pathosformel. The handful of exceptions are all borderline cases, placed in peripheral to Cluster 1 (Clusters 7 and 13). Looking more closely at the images themselves, as in Figure 4, this becomes visually clear: not only do the Pathosformeln share certain pose features (most importantly, a raised arm) present nowhere else in the dataset. To date, however, the authors know of no art-historical literature that has identified such morphological unity.

Figure 4, clockwise from top-left: Laocoön, Orpheus, Fortuna, Nymph, Judith, Perseus - all except Orpheus are in the Atlas.Theidentified Pathosformeln share distinguishing features from the other characters in the Atlas: a raised arm, most often accompanied by a lowered second arm, and a slight twist of the body.

7. Concluding remarks Our morphological model for Pathosformel is statistically strong: but what are the art-historical implications? The oppositional symmetry and raised arm of Cluster 1 (Fig. 2) reminds us of a Contrapposto, but the bodies themselves are far removed from such classical balance (e.g. Fig. 4, top). Rather than movement, tension (between upper and lower body) seems to be the fundamental element of Pathosformel - the nature of which will be the subject of a subsequent publication. Our morphological analysis has shown that static pose can identify Pathosformeln, and that a study of static pose through a large collection of artistic works can identify links across styles, periods and cultures. Automatic detection of pose is a focus of current research, and will allow us to expand our art-historical models beyond the Bilderatlas. Humans (thus bodies, and poses) are unsurprisingly the most common feature of human art, and therefore make excellent objects of study for an art history of the Longue Durée (Robb 2015).

Bibliography Darwin, C. 1872. The expression of the emotions in man and animals. London, John Murray Dortet-Bernadet, J.L. and Wicker, N., 2008. Model-based clustering on the unit sphere with an illustration using gene expression profiles. In Biostatistics, 9(1), pp.66-80. Gombrich, E.H. 1970. Aby Warburg, an intellectual biography. Chicago University Press. Impett, L. and Süsstrunk, S., 2016. Pose and Pathosformel in Aby Warburg s Bilderatlas. In European Conference on Computer Vision (pp. 888-902). Springer International Publishing. Johansson, G., 1973. Visual perception of biological motion and a model for its analysis. In Perception & psychophysics, 14(2), pp.201-211. Kozlowski, L.T. and Cutting, J.E., 1977. Recognizing the sex of a walker from a dynamic point-light display. In Perception & Psychophysics, 21(6), pp.575-580. Montepare, J.M., Goldstein, S.B. and Clausen, A., 1987. The identificationof emotions from gait information. In Journal of Nonverbal Behavior, 11(1), pp.33-42. Moretti, F., 2013. Operationalizing ; or, The Function of Measurement in Modern Literary. Stanford Literary Lab Pamphlet Series. litlab.stanford. edu/literarylabpamphlet6 Robb, J. and Harris, O.J.T. 2013. Body worlds and their history: some working concepts. The Body in History: Europe from the Palaeolithic to the future : 7(31). Settis, Pathos und Ethos, Morphologie und Funktion, in W. Kemp, G. Mattenklott, M. Wagner, M. Warnke, eds, Vorträge aus dem Warburg-Haus, Band 1, De Gruyter, Berlin 1997, pp. 39-44. Van den Stock, J., Righart, R. and De Gelder, B., 2007. Body expressions influence recognition of emotions in the face and voice. In Emotion, 7(3), p.487. Warburg, A., trans. D. Britt. 1999 The Renewal of Pagan Antiquity. Los Angeles, Getty Research Institute.