Folktale Description Language
Kotaro Sato
FDL Research Group
Abstract
In this paper, we present a new approach to the structural analysis of folktales. We
apply computers and the Internet to the analysis. We
designed an FDL (Folktale Description Language) and translated Japanese folktales
into FDL. We also developed an OFDA (On-line FDL
Document Analyzer) that searches the FDL document database for documents that
include a specified pattern of sentences. Both
computers and people can understand FDL easily.
The FDL document
consists of "elemental sentences." A combination
of elemental sentences makes up the functions in structural analysis, the motifs
and the types.
The results of
OFDA analysis help us find some interesting characteristics of the folktales,
as well as hints for researching the folktales in the future.
Keywords: Folktale Description Language, Structural Analysis, Function,
Motif, Type, the Internet, World Wide Web, Search System, Computer Aided Translation,
Story Engineering.
1. Introduction
Structural analysis is an important category of folktale research. We
propose the application of computers and the Internet to analyze the structure
of folktales.
Vladimir Propp
[5] and Alan Dundes [2] have showed that folktales have a lot of structural
characteristics. But until now the structural
analysis has been done by handiwork. So it was
inefficient, closed and hard to recycle.
Our approach
consists of following procedure:
(1) Translate original stories into a series of "elemental sentences"
in FDL (Folktale Description Language).
(2) Register these translated stories with the database on the computer network.
(3) Enter the structure you want to find as a search pattern on a web browser,
then you can get matched stories anytime, anywhere.
You can analyze
not only the structures but also the types, the motifs, the styles and the images
of the folktales. We expect FDL technology to
have a wide range of application in interdisciplinary fields too.
2. Structural Analysis of Folktale on Computer
Today, computers cannot understand natural languages, not to mention stories. But
computers are useful tools to analyze the structure of folktales. We
do not need a concrete meanings of the objects, but we need the relationships
between the objects expressed as a sign in structuralism. Computers
are good at searching for symbolic relationships and can do it quickly. Different
from novels or poems, folktales have very few emotional or abstract expressions. Folktales
and computers are well-suited because the folktales are only a set of objective
events.
To analyze the
structure of the folktales by computers, we designed an artificial language
called FDL (Folktale Description Language) with the basic idea of "Transfer
drives story."
For example,
in "Snow White", there are many people moving here and there. Furthermore,
some props (e.g., comb, apple) and information (e.g., "Be careful with
your stepmother.") are moved, too.
In FDL, "elemental
sentence" has the following basic pattern:
Subject (who moves) Verb (how it moves) Object
(what it moves)
where "Verb" means doing some action to an "Object" in a
broader sense.
A combination
of elemental sentences creates the functions in structural analysis, the motifs
and the types. Table 1 is a model of relationships
among them. There are folktale texts (X1, X2 ...
Xn) translated into the combination of the elemental sentences (E1, E2 ... En). For
example, when M1 (motif-1) consists of E1 + E2, X1 and X2 contain M1. When
T1 (type-1) consists of M1 + M2 + M3, X1 and X2 is T1. In
this sample, X2 is both T1 and T2, because X2 also has T2 after T1. On
the other hand, E5 + E6 and E5' + E6' make different motifs (M3, M8) but same
function (F3). A one-to-one correspondence between
motif and function doesn't come generally.
Therefore, "elemental
sentences" are the base of folktale research in various categories. And
we developed a software system called OFDA (On-line FDL Document Analyzer) that
searches the folktale texts in FDL for the matched pattern described as elemental
sentences.
Table 1: A Model of Relationships Among Elemental
Sentences, Functions, Motifs and Types
Scene
|
Scene1 (S1)
|
Scene2 (S2)
|
Scene3 (S3)
|
Scene4 (S4)
|
Scene5 (S5)
|
Scene6 (S6)
|
Motif Formation
|
Type
|
|
Folktale Text
|
X1
(Motif →)
|
E1 E2
|
E3 E4
|
E5 E6
|
|
|
|
|
T1
|
M1
|
M2
|
M3
|
|
|
|
M1+M2+M3
|
|||
X2
(Motif →)
|
E1 E2
|
E3 E4
|
E5 E6
|
E7 E8
|
E9 E10 E11
|
E12
|
|
T1+T2
|
|
M1
|
M2
|
M3
|
M4
|
M5
|
M6
|
M1+M2+M3+M4+M5+M6
|
|||
X3
(Motif →)
|
|
|
|
E7 E8
|
E9 E10 E11
|
E12
|
|
T2
|
|
|
|
|
M4
|
M5
|
M6
|
M4+M5+M6
|
|||
X4
(Motif →)
|
|
|
|
E7 E8
|
E9 E10 E11
|
|
|
T2
|
|
|
|
|
M4
|
M5
|
|
M4+M5
|
|||
X5
(Motif →)
|
E1' E2'
|
|
E5' E6'
|
|
|
E12'
|
|
T3
|
|
M7
|
|
M8
|
|
|
M9
|
M7+M8+M9
|
|||
X6
(Motif →)
|
E2'
|
|
|
E7' E8'
|
E9' E11'
|
|
|
T4
|
|
M7
|
|
|
M10
|
M11
|
|
M7+M10+M11
|
|||
Function
|
F1 F2
|
-
|
F3
|
F4
|
-
|
-
|
-
|
Note: "X" is "Folktale
Text." "E" is "Elemental Sentence." "F"
is "Function." "M" is "Motif." "T"
is "Type."
Motif1 = E1+E2, Motif2 = E3+E4, Motif3 = E5+E6, Motif4 = E7+E8, Motif5 = E9+E10+E11,
Motif6 = E12, Motif7 = E2', Motif8 = E5'+E6', Motif9 = E12', Motif10 = E7'+E8',
Motif11 = E9'+E11'
Function1 = E1 or E1', Function2 = E2 or E2', Function3 = M3 or M8 = (E5+E6)
or (E5'+E6'), Function4 = (M4+M5) or (M10+M11) = (E7+E8+E9+E10+E11) or (E7'+E8'+E9'+E11')
Type1 = M1+M2+M3, Type2 = M4+M5, Type3 = M7+M8+M9, Type4 = M7+M10+M11
"=" means "require" here.
3. Folktale Description Language
3.1 Overview of FDL
FDL (Folktale Description Language) is an artificial language used to describe
the structure of folktales. As a general rule,
one story is copied onto one file, and one event makes up one line. Not
only computers but also people can read FDL easily.
Language, including
FDL, consists of grammar and vocabulary. The grammar
of FDL is almost completed, but the vocabulary of FDL is under construction. So
we are compiling an "FDL dictionary" that includes all words one can
see or use in FDL documents.
We have translated
30 folktales from Japanese Folktales [4] into Japanese FDL (JFDL) as
a test. We found some interests in these FDL documents
by using the OFDA search system. (See below for
further details.)
FDL is ready
for internationalization. For example, to translate
Japanese FDL (JFDL) into English FDL (EFDL), you just need to replace words.
3.2 Sample of FDL
Let us read a famous Japanese folktale "Urashima Taro" in FDL (JFDL,
EFDL).
An FDL document
consists of three parts.
(1) TITLE part includes the title, the translator, the translation date and
so on (The top letter of the line is "#").
(2) INFO part includes the type, the sub-type, the original title, the source
and so on.
(3) STORY part includes the body divided into some motifs.
"{" and "}" expresses the range of the "INFO part"
or the "STORY part."
The following
is a literal translation (line by line) from FDL followed by commentary.
--------
Motif-1 (Scene-1)
EFDL: UrashimaTaro = @sea.near.village
English: Urashima Taro was at a
village near the sea.
Commentary: "=" is a verb
that means the existence or state, similar to "be" in English. "@"
is a preposition similar to "at" in English. "."
is a connective. "A.B" (in EFDL) means
"A B" or "B of A" (in English). For
example, "red.flower" (in EFDL) means "red flower" (in English),
and "table.leg" (in EFDL) means "leg of table" (in English).
EFDL: child** bully turtle
English: Children bullied a turtle.
Commentary: The past tense (e.g.,
"bullied") is expressed in the present tense (e.g., "bully")
in FDL. "*" is a numeral mark. For
example, "daughter*3" (in EFDL) means "3 daughters" (in
English), and "money**" (in EFDL) means "a lot of money"
(in English).
EFDL: UrashimaTaro buy turtle <child**
English: Urashima Taro bought the
turtle from the children.
Commentary: "<" is
a preposition similar to "from" in English.
EFDL: UrashimaTaro release turtle
>sea
English: Urashima Taro released
the turtle to the sea.
Commentary: ">" is
a preposition similar to "to" in English.
Motif-2 (Scene-2)
EFDL: UrashimaTaro fish @sea
English: Urashima Taro fished in
the sea.
EFDL: turtle appear <sea
English: The turtle appeared from
the sea.
EFDL: turtle say >UrashimaTaro
(turtle = Otohime.messenger)
English: The turtle said to Urashima
Taro, "I am Otohime's messenger."
Commentary: When a sentence includes
other sentences, it is enclosed with round brackets. Otohime
is the name of a princess who lives in Ryugujoh, an undersea castle. A
proper noun should be with a capital letter.
EFDL: turtle want (UrashimaTaro
ride >turtle.back)
English: The turtle wanted Urashima
Taro to ride on his back.
EFDL: UrashimaTaro ride >turtle.back
# closing eyes
English: Urashima Taro rode on turtle's
back.
Commentary: "#" is an
annotation mark. You may note an important or
interesting thing without relation to the structure of the story.
It's effective to the end of the line.
EFDL: UrashimaTaro arrive @Ryugujoh
English: Urashima Taro arrived at
Ryugujoh.
Motif-3 (Scene-3)
EFDL: Otohime say >UrashimaTaro
((UrashimaTaro :help turtle) ‾ (Otohime :wait: UrashimaTaro +feast))
English: Otohime said to Urashima
Taro, "I have been waiting for you with a feast because you helped the
turtle."
Commentary: "‾" is a conjunction
mark that shows a strong relation between two sentences. "(sentence_A)
‾ (sentence_B)" (in EFDL) means "sentence_B because sentence_A"
(in English). "+" is a preposition similar
to "with" in English. ":"
is a tense mark. For example, ":verb"
is the past tense, "verb:" is the future tense and ":verb:"
is the progressive form.
EFDL: UrashimaTaro eat feast
English: Urashima Taro ate a feast.
EFDL: UrashimaTaro watch fish**.dance
English: Urashima Taro watched a
dance of fish.
EFDL: $TIME=3.month.during, UrashimaTaro
live @Ryugujoh
English: For three months, Urashima
Taro lived at Ryugujoh.
Commentary: Use "$TIME="
to express the time that includes date, season, etc.
EFDL: UrashimaTaro say >Otohime
(UrashimaTaro want (UrashimaTaro return >village))
English: Urashima Taro said to Otohime,
"I want to return to the village."
EFDL: Otohime give >UrashimaTaro
tamatebako
English: Otohime gave a tamatebako
to Urashima Taro.
Commentary: Tamatebako is an "Urashima's
casket."
EFDL: Otohime say >UrashimaTaro
(UrashimaTaro !permission.open tamatebako)
English: Otohime said to Urashima
Taro, "Don't open the tamatebako."
Commentary: "!" is a negation
mark. "permission" is an auxiliary verb
that means "have a permission to do" in English.
Motif-4 (Scene-4)
EFDL: UrashimaTaro return >village
# ride on turtle's back
English: Urashima Taro returned
to the village.
EFDL: village.state =.change
English: The state of the village
had been changed.
Commentary: "=.verb" expresses
the state.
EFDL: UrashimaTaro ask >oldster
(oldster =.know UrashimaTaro)
English: Urashima Taro asked an
oldster, "Do you know Urashima Taro ?"
EFDL: oldster say >UrashimaTaro
($TIME=300.year.before, UrashimaTaro !:return <sea) # hearsay
English: The oldster said to Urashima
Taro, "Three hundred years ago, he didn't return from the sea."
EFDL: (UrashimaTaro open tamatebako)
‾ (UrashimaTaro become oldMan) # white smoke from tamatebako
English: Because Urashima Taro opened
the tamatebako, he became an old man.
Commentary: A collocation (e.g.,
"old man") is described as one word (e.g., "oldMan").
--------
4. On-line FDL Document Analyzer
4.1 Overview of OFDA
OFDA (On-line FDL Document Analyzer) is software used to analyze folktales in
FDL. Basically, OFDA searches the FDL document
database for documents (story texts) that include the same structure as (or
similar to) your inputted sentences.
However, OFDA
executes a special search. To put it concretely,
when you input plural sentences (e.g., three sentences) as a search pattern,
OFDA searches for the FDL documents that include these sentences in sequence
with or without some unrelated sentences between them. And
OFDA allows you to input some defective sentences or the regular expressions
[3]. These contrivances mean that OFDA can extract
a common structure from different type stories.
4.2 System Architecture of OFDA
OFDA is a client server system on World Wide Web (see Figure 1). There
are folktale databases in FDL and search engine on the web server. Original
CGI script of OFDA is described with Perl script by us. You
can use OFDA through the web browser (e.g., Netscape) from all over the world
(see Figures 2 and 3). OFDA is located on:
http://www.docca.net/mukaken/ofda.html
(Japanese environment required)
Figure 1: System architecture of OFDA.
Figure 2: Snapshot of OFDA (input page).
Figure 3: Snapshot of OFDA (output page).
4.3 Sample of Search and Analysis with OFDA
Explanatory Notes -------------------------------------------------
Input-J: the search pattern in Japanese
FDL (JFDL).
Input-E: the search pattern translated
into English FDL (EFDL).
Output: the result in Japanese and
translated into English.
Commentary: the explanation.
--------------------------------------------------------------------------------
Note: Search targets are 30 folktales in
JFDL from Japanese Folktales [4].
Input-J: 針
Input-E: needle
Output: Three folktales 'Issun Boushi'
(一寸法師), 'Yumemi Kozou' (夢見小僧) and 'Hebi Muko Iri' (蛇婿入り) are hit. OFDA
shows the matched lines in FDL with the line numbers. For
example, in 'Hebi Muko Iri',
In JFDL:
34: 母 命令する >娘 (娘 刺す 針 >若者。着物。裾)
In EFDL:
34: mother command >daughter (daughter stab
needle >youth.clothes.hem)
Commentary: Example of a word search. OFDA
searches for the matched word that appears for the first time in each document. When
you input the following as a search pattern, you get matched stories in which
'needle' appears twice.
In JFDL:
針 《改行》
針
In EFDL:
needle <<return>>
needle
Note: 《改行》
or <<return>> means 'type RETURN or ENTER key on the keyboard.'
The first or the second appearances are important information to analyze the
structure of the story.
Input-J: ”植物
Input-E: "plant
Output: Some plants appear in sixteen
folktales (53%).
Commentary: Example of a word class
search. Double quotation (") is a mark of the
'word class' that means the synonym group. ' "plant
' consists of 'tree, leaf, bud, flower, pine, cryptomeria, willow, mulberry,
peach, cherry, iris, chrysanthemum, camellia, mushroom' (木、葉、芽、花、松、杉、柳、桑、桃、桜、菖蒲、菊、椿、茸)
temporarily. It seems probable that Japanese folktales
have a close relation with plants.
Input-J: ”動物 ”話す
Input-E: "animal "speak
Output: Five folktales 'Hanasaka
Jii' (花咲かじい), 'Hebi Muko Iri' (蛇婿入り), 'Neko no Odori' (猫の踊り), 'Saru
Muko Iri' (猿婿入り) and 'Urashima Taro' (浦島太郎) are hit.
Commentary: Example of a sentence
search. OFDA searches for the story that includes
'some animals spoke something.'
Input-J: ”動物 ”話す 《改行》
娘 なる ”動物。嫁
Input-E: "animal "speak
<<return>>
daughter become "animal.wife
Output: One folktale 'Saru Muko
Iri' (猿婿入り) is hit.
Commentary: Example of a plural
sentences search. There is only one folktale that
includes 'a daughter became animal's wife after an animal spoke.' Such
search pattern is suitable for structural analysis. But
this is a simple sample. You can input more lines
as a search pattern.
Input-J: ”男 ”移動する 《改行》
Motif-2
Input-E: "man "move <<return>>
Motif-2
Output: There are ten folktales
including 'a man moved' in the opening scene (Motif-1).
Commentary: You can use a more complex
search pattern that includes 'Motif-2', etc.
Input-J: 命令する >”男
Input-E: command >"man
Output: There are four folktales
including 'someone commanded a man to do something.'
Input-J: 命令する >”女
Input-E: command >"woman
Output: There is one folktale that
includes 'someone commanded a woman to do something.'
Commentary: It seems likely that
the former is more than the latter in Japanese folktales. To
declare it, of course, more folktales are needed on the database in the future.
5. Conclusion and Future Work
We have presented a new environment with computers and the Internet to analyze
folktales. We designed an artificial language,
FDL, translated Japanese folktales into FDL, and developed an OFDA search system
to analyze the structure of the folktales. We
now get some interesting characteristics of folktales and hints to research
folktales in the future.
We also built
the CAT (Computer Aided Translation) system that includes "JFDL CHECKER"
and "JFDL UPLOADER" as a client server system on World Wide Web. JFDL
CHECKER indicates the errors of vocabulary and spelling of the words in a JFDL
document. JFDL UPLOADER allows translators to
send a JFDL document to the OFDA server from anywhere and anytime, which makes
our translation work more efficient.
We make full
use of the computers and the Internet for all scholars, too. The
results of our research and the databases are open to everyone in the world.
In future work,
we hope to extend our system and research following items:
Our approach is the first step to a new science called "Story Engineering."
A. Appendix
A.1 Additional Mark List
We have explained the marks of FDL in "3.2 Sample of FDL." But
there are more marks in FDL:
"?" is a mark of obscure element. "??"
means the word not narrated explicitly. "?"
makes an interrogative sentence.
(e.g.,) 4.dwarf say (? eat 4.dwarf.dish)
Note: "4.dwarf" means "the
fourth dwarf."
"$" makes "state sentence" and "sub sentence",
which support "main sentence."
(e.g., for "state sentence")
$NATURALENVIRONMENT = rain
$ADVERB=always, $ADVERB=together, (1.merchant & 2.merchant) travel ;business
(e.g., for "sub sentence")
cat say >daughter ($CONDITION=(daughter say >(father & mother) (cat
:(sing & dance))), daughter die)
$SAMETIME=(bushWarbler fly.leave >storehouse.outside), (mansion & storehouse)
disappear
Note: "fly.leave" is a compound
verb that means "fly to leave." But "(sing
& dance)" is not a compound verb in FDL.
"$" makes other elements. "$?"
means preceding "?" (not "??"). "$??"
means preceding "??" (not "?"). "$CIRCUMSTANCES"
means circumstances or particulars.
"," is a punctuation mark connecting "state sentences" or
"sub sentence", and "main sentence."
"%" rewords the character from this point.
(e.g.,) % daughter > wife
"&" is a conjunctive mark similar to "and" in English.
"|" is a conjunctive mark similar to "or" in English.
";" expresses the objective of the sentence.
(e.g.,) dwarf*7 go >mountain ;(dwarf*7 dig (mineral & gold))
"^" is a mark of supplementary explanation of the preceding line.
(e.g.,) ^ (flower** & herb**) = @garden
A.2 Auxiliary Verbs in FDL
"can" means "be able to do."
"!can" means "be not able to do."
"obligation" means "be forced to do."
"!obligation" means "be not forced to do."
"should" means "ought to do."
"!should" means "ought not to do."
"permission" means "have a permission to do."
"!permission" means "have no permission to do."
A.3 Members of FDL Research Group
Nozomi Hachino (Compiler, Translator)
Kogi Kato (Compiler, Translator)
Yuko Matsumura (Compiler, Translator)
Minako Numaga (Compiler, Translator)
Toshio Ozawa (Producer)
Kotaro Sato (Director, Programmer, Translator)
References
[1] Aarne, Antti and Stith Thompson. The Types
of The Folktale. Helsinki: Helsingin Liikekirjapaino,
1961, 1981.
[2] Dundes, Alan. The Morphology of North American
Indian Folktales. Helsinki: Helsingin Liikekirjapaino,
1964.
[3] Friedl, Jeffrey. Mastering Regular Expressions:
Powerful Techniques for Perl and Other Tools. Sebastopol:
O'Reilly, 1997.
[4] Ozawa, Toshio and Suekichi Akaba. Japanese
Folktales. Tokyo: Fukuinkanshoten, 1995.
[5] Propp, Vladimir. Morphology of the Folktale. University
of Texas Press, 1958, 1968.
This paper was originally published in Horizon of Folktale Reserach,
October 1, 2002. And it was based on Images of Folktale 3, March 31,
2000.
to FDL home page (Japanese environment required)
Copyright (c) 2002 Kotaro Sato, FDL Research Group. All rights reserved.