110-item wordlists for the Italic-Romance group (Indo-European family).
Languages included: Archaic Latin; Late Classical Latin; Megleno Romanian; Istro Romanian;
Aromanian; Romanian; Dalmatian; Friulian; Gardenese Ladin; Fassano Ladin; Rumantsch
Grischun; Sursilvan Romansh; Surmiran Romansh; Vallader Romansh; Lanzo Torinese Piemontese;
Barbania Piemontese; Carmagnola Piemontese; Turinese Piemontese; Vercellese Piemontese;
Bergamo Lombard; Plesio Lombard; Ravennate Romagnol; Ferrarese Emiliano; Carpigiano
Emiliano; Reggiano Emiliano; Rapallo Ligurian; Genoese Ligurian; Stella Ligurian; Venice
Venetian; Primiero Venetian; Bellunese Venetian; Italian, Old; Standard Italian; Grosseto Tuscan
Italian; Foligno Italian; Neapolitan; Logudorese; Campidanese; Palermitan Sicilian; Messinese
Sicilian; Catanian Sicilian; South-Eastern Sicilian; Central Catalan; North-Western Catalan;
Minorcan Catalan; Castelló de la Plana Catalan; Valencia Catalan; Manises Catalan; Castilian
Spanish; Asturian; Standard Portuguese; Galician; Provençal Occitan; Savoyard Franco-Provençal;
French, Old; Standard French; Picard; Walloon.
Latin.
The Archaic Classical Latin list is based on the plays of Titus Maccius Plautus (c. 254184 BC, born
in Sarsina) plays. In difficlult cases the list is supported by data from "De agri cultura" by Marcus
Porcius Cato (234149 BC, born in Tusculum).
The Late Classical Latin list is based on works by Apuleius (c. 124 c. 170 AD, born in Madaurus,
Numidia): "Metamorphoses", "Apologia", "Florida", "De dogmate Platonis", "De mundo". (It should
be noted that Apuleius mother tongue was probably Phoenician or Greek, but he learned Latin at a
very young age and mastered it to a very high level while living in Rome.) In some particularly
complex cases we have also quoted material from Aulus Gellius (c. 125 after 180 AD, born in
Rome) "Attic Nights", Petronius (c. 27 66 AD, born in Massalia (Marseille)) "Satyricon", and
Vulgata (translated by St. Jerome, c. 347 420 AD, born near Emona, Venetia et Histria).
Abbreviations: Amph. = Amphitruo; Asin. = Asinaria; Aul. = Aulularia; Bac. = Bacchides; Capt. =
Captivi; Cas. = Casina; Cist. = Cistellaria; Curc. = Curculio; Epid = Epidicus; Men. = Menaechmi;
Merc. = Mercator; Mil. = Miles gloriosus; Most. = Mostellaria; Poen. = Poenulus; Pseud. =
Pseudolus; Rud. = Rudens; Trin. = Trinummus; Truc. = Truculentus; Vid. = Vidularia.
Megleno Romanian.
The village named Oșani in Capidan’s works bore the name Ossiani or Xifiani in Greek. On
6/6/1925 it was renamed to Archangelos The village Lugunţa bore the name Lunguntsa and was
renamed to Lankadia / Langadia on 2/3/1926. The village Ţârnareca bore the name Tserna Reka and
was renamed to Karpi on 1/11/1926.
All the verbs are given in the first-person singular present tense form, as in [Capidan 1935].
Capidans transcription is transliterated as follows:
[Capidan 1935]
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
dz
ʒ
ts
c
z
z
s
s
ǯ
č
j
ž
ș
š
gi
ȡ
chi
ȶ
g
g
c
k
h
x
m
m
n
n
ń
ɲ
r
r
l
l
ľ
ʎ
ł
ɫ
u
w
i
y
i
i
e
e
ę
ɛ
ă
ə
a
a
ɐ
ǫ
ɔ
u
u
ea
e
a
oa
o
a
V
V
, V
ˈV
Istro Romanian.
The Istro Romanian language has two main dialects: one spoken in Žejane and the other spoken in
all the rest of the villages located to the South of Žejane. Žejane is separated from the southern
villages by Croatian-speaking territory. There are two mismatches between the Žejane dialect and
the southern dialect (far, leaf) plus three additional mismatches caused by loanwords in one of
the dialects (belly, dog and rain).
The available sources cover both dialects, and the main source, Kovačec’s dictionary, marks all the
forms precisely with the name of the village where they were written down.
This list is based primarily on the Southern dialect, but Žejane forms are given as well.
The names of Istro Romanian villages are given in their Croatian form, the correspondences
between the Croatian and Istro Romanian names are: Brdo = Bârdo, Jesenovik = Sucodru, Kostrčan
= Costârčan, Nova Vas = Nosolo / Noselo, Šušnjevica = Susńeviţę / Șușńeviţę, Žejane = Žeiân.
The following transliterational chart covers our principal sources:
Byhan 1899
Glavina 1905
Kovačec 2010
Modified Croatian Orthography
MosLex
b
b
b
b
b
p
p
p
p
p
v, w
v
v
v
v
f
f
f
f
f
d
d
d
d
d
t
t
t
t
t
ts
ţ
ţ
c
c
dz
d
dz
ʒ
z
z
z
z
z
s
s
s
s
s
c before i and
e; cĭ, ci
č
č
č
ǧ
đ
ǯ
ẑ, ž
j
ž
ž
ž
ŝ, š
ș
ș
š
š
ć, t″
ć
ȶ
g
g
g
g
g
k
c, ch before i
and e
c, k before i
and e
k
k
ɣ, g
ɣ
g
ɣ
h
h
h
h
x
m
m
m
m
m
n
n
n
n
n
ń
ń
nj
ɲ
ñ
n
n, ŋ
n
ŋ
r
r
r
r
r
l
l
l
l
l
ľ
ľ
lj
ʎ
i
ĭ
i
j
y
i
i
i
i
i
e
e
e
e
e
ę
ea
ę
ę
ä
o
î, ă
â
â
ə
a
a
a
a
a
ǫ
å
å
ɒ
o
o
o
o
o
u
u
u
u
u
V
V
V
ˈV
1. Sârbu and Frăţilă use the same transcription as Kovačec, the only exception is j instead of ž.
2. The sound g has shifted to ɣ in the Žejane dialect in almost all positions, with the exception of
clusters ŋg and gʎ [Narumov 2001: 659], so they must be regarded as allophones of one phoneme.
This is why there is no difference between g and ɣ in so-called Modified Croatian Orthography.
Aromanian
All the verbs are given in the first-person singular present tense form.
The following transliterational chart covers our principal sources:
Dalametra
Papahagi
Bara et al.
Cunia
MosLex
b
b
b
b
b
p
p
p
p
p
v
v
v
v
v
f
f
f
f
f
δ
δ
δ
dh
ð
θ
θ
θ
th
θ
d
d
d
d
d
t
t
t
t
t
ţ
ţ
ţ
ts
c
dz
dz
d
dz
ʒ
z
z
z
z
z
s
s
s
s
s
c before e, i
č
č
c before e, i
č
g before e, i
ǧ
ǧ
g before e, i
ǯ
j
j
ž
j
ž
ș
ș
ș
sh
š
gh before e, i
ϟ
g
gh before e, i
ȡ
ch before e, i
k
k
ch before e, i
ȶ
h
h
h
h
ʆ
ɣ
ɣ
ɣ’
y
ʑ
g
g
g
g
g
c
c
k
c
k
ɣ
ɣ
ɣ
gh
ɣ
h
h
h
h
x
m
m
m
m
m
n
n
n
n
n
n
n
nj
ɲ
r
r
r
r
r
l
l
l
l
l
ľ
ľ
lj
ʎ
ĭ
ĭ
ĭ
i
y
ŭ
ŭ
u
u
w
i
i
i
i
i
e
e
e
e
e
î
î
-
-
ɨ
ă
ă
ā
ã
ə
a
a
a
a
a
o
o
o
o
o
u
u
u
u
u
ea
ea
ea
ea
e
a
oa
oa
oa
oa
o
a
-
-; ŭ after dz, j, ţ, s
u
-
ŭ
-
-; ĭ
i
-
ĭ
-
-
ā
-
ə
V
V
ˈV
-
ˈV
V. Romanian
According to [Lukht & Narumov 2001: 575], the Romanian and Moldavian literary languages are
identical. We have tried to verify this statement by constructing a separate Swadesh list for
Moldavian, based on two large Russian-Moldavian dictionaries [Podiko 1973; Borsh & Zaporozhan
1990]. As may be seen from the listing in the notes section, they do indeed coincide on all 110
points, so there is no reason to separate these idioms from the lexicostatistical point of view.
The following transliterational chart covers our principal sources:
Modern Romanian orthography
Moldavian Cyrillic orthography
MosLex
b
б
b
p
п
p
v
в
v
f
ф
f
d
д
d
t
т
t
ţ
ц
c
z
з
z
s
с
s
c before e, i
ч
č
g before e, i
ӂ
ǯ
j
ж
ž
ș
ш
š
g, gh before e, i
г
g
k, ch before e, i
к
k
h
х
h
m
м
m
n
н
n
l
л
l
r
р
r
i
й
y
u
у
w
i
и
i
e
е
e
î, â
ы
ɨ
ă
э
ə
a
а
a
o
о
o
u
у
u
ea
я
e
a
oa
оа
o
a
Note: In word-final position, consonants before i become palatalized, in which case i is not
pronounced.
VI. Dalmatian
The majority of Dalmatian data are written down after Antonio Udina, the last native speaker of
Dalmatian. In making this list, we tried to use mainly Udinas contexts, but if it was necessary, we
also added information from other sources. Prior to being interviewed by researchers, Udina had not
used Dalmatian for a few decades, so his Dalmatian is heavily influenced by Venetian. Taking this
into account, we do not regard the Dalmatian words whose phonetics is partly influenced by
Venetian as borrowings. Examples of such words include nwat night or pyat breast with -t
instead of -pt in the place of Latin -ct-.
Bartolis transcription is transliterated as follows:
Bartoli
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
ż
ʒ
z
c
z
s
s
ǧ
ǯ
č
č
g
g
k
k
m
m
n
n
ń
ɲ
ŋ
r
r
l
l
ľ
ʎ
u
w
j, i
y
i
i
e
e
a
a
o
o
u
u
Note: there is also an allophone of /u/, transcribed by Bartoli as u
.
Friulian
The Friulian language is divided into three dialectal groups: Central-Oriental, Western and Carnian.
Unfortunately, we managed to collect the wordlist only for the common Friulian koine, based on the
Central-Oriental dialect.
The following transliterational chart covers our principal sources:
Pirona
Modern Orthography
MosLex
b
b
b
p
p
p
v
v
v
f
f
f
d
d
d
t
t
t
z
ʒ
z
z
c
s between two vowels
s between two vowels
z
s, ss between two
vowels
s, ss between two vowels
s
z
z
ǯ
ç, c before e and i
ç, c before e and i
č
gi
gj
ȡ
çh
cj
ȶ
g, gh before e and i
g, gh before e and i
g
c, ch before e and i
c, ch before e and i
k
m
m
m
n
n
n
gn
gn
ɲ
n
n
ŋ
r
r
r
l
l
l
u
u
w
j
j
y
i
i
i
é, e
e
e
è
e
ɛ
a
a
a
ó, o
o
o
ò
o
ɔ
u
u
u
V
V
V
, V
V
ˈV
Notes:
1. We use the transcriptional signs ȡ and ȶ for the Friulian sounds designated in modern orthography
as gj and cj, but in fact these sounds are post-palatal/prevelar, not palatal [Narumov & Sukhachev
2001: 371].
2. The short high vowels i and u are normally realized as ɪ and ʊ respectively.
Ladin
Ladin is a language with a high degree of dialectal diversity. Unfortunately, we were unable to
collect full lists for all the dialects; complete lists are available only for Gardenese and Fassano
dialects. However, the notes section on the Gardenese dialect contains partial data on the
corresponding equivalents for the Marebbano, Badiotto, Livinallese, and Nones dialects.
For lexicostatistical discrepancies between the Gardenese and Fassano lists see: black, hear,
leaf, lie, man, tree, woman, snake.
The following transliterational chart covers our principal sources:
Gartner
DILF
Forni
MosLex
b
b
b
b
p
p
p
p
v
v
v
v
f
f
f
f
d
d
d
d
t
t
t
t
ts
z
z
c
z
s between two vowels; ś
in the very beginning of
the word
s between two vowels
z
s
s, ss between two
vowels
s, ss between two
vowels
s
dy
g before e, i
g before e, ë, i
ǯ
c before e, i or at the end
of a word
c before e, ë, i or at the
end of a word
č
ž
j
j
ž
š
sc before e, i or at the
end of a word
sc before e, ë, i or at the
end of a word
š
t
c before e, i
c before e, ë, i
ɕ
g
g
g
g
k
c
c
k
m
m
m
m
n
n
n, -nn
n
ny
gn
ɲ
ŋ
n
n
ŋ
r
r
r
r
l
l
l
l
u
u
w
i
i
i
y
i
i
i
i
e
e
e
e
ę
è, e
o
ɛ
a
a
a
a
a
, ə
-
ë, e
ə
o
o
o
o
ǫ
ò, o
o
ɔ
u
u
u
u
ʋ
-
-
ü
-
-
ˈV
Romansh
The Romansh language has five main dialects: Sursilvan, Sutsilvan, Surmiran, Puter, Vallader, and
the literary language (Rumantsch Grischun). Unfortunately, we had no possibility to collect the lists
for Sutsilvan and Puter.
For the discrepancies between the lists see: burn, eat, feather, hear, kill, mountain, not,
person, that, tree, woman, yellow, near.
Romansh orthography is transliterated as follows:
Romansh orthography
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
z
c
s between vowels
z
s; ss between vowels
s
g before e and i
ǯ
tsch
č
sch; s before a voiced consonant
ž
sch; s before a voiceless consonant
š
tg
ȶ
ch, tg (Grischun)
ɕ
g
g
c
k
m
m
n
n
gn
ɲ
ng
ŋ
r
r
l
l
gl
ʎ
u
w
j, i
y
i
i
e
e
e, è
ɛ
u
ʊ
a
a
o
o
o
ɔ
u
u
-, V
ˈV
Piemontese
The Piemontese language has two main dialects: High Piemontese (divided into Turinese, Canavese,
Biellese, Langarolo, Monferrino and other dialects) and Low Piemontese [Chelysheva 2001: 95].
Unfortunately, we were unable to collect lists for all these dialects. We only present the lists for the
idioms of Lanzo Torinese, Barbania and Carmagnola (close subdialects of the Turinese dialect) and
the Vercellese dialect, which is sometimes claimed as transitional between Piemontese and Lombard
below.
For discrepancies between the lists see bark, big, breast, many, swim, tree, what,
woman, heavy, wind.
Piemontese orthography is transliterated as follows:
Piemontese orthography
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
z, s between two vowels
z
s
s
g before e, i
ǯ
c before e, i
č
g; -gh
g
c; -ch
k
m
m
n
n
gn
ɲ
n; n-
ŋ
r
r
l
l
u
w
j, i
y
i
i
u
ü
é
e
è
ɛ
eu
ö
ë
ə
a
a
ó
o
ò
ɔ
o
u
V
, V
ˈV
Note: Mr. Noris gave all the forms in the IPA transcription, so we do not discuss the forms in
Piemontese orthography for the Vercellese dialect.
Lombard
The Lombard language is divided into many dialects. Unfortunately, we had no possibility to collect
the full list for all the dialects, so we only present the lists for the Bergamo dialect and the dialect of
Plesio below.
For discrepancies between the lists see bark, big, bite, dry, kull, stone, woman, heavy,
snake, thin.
The Lombard orthography is transliterated as follows:
Lombard orthography
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
z
c
z
z
s
s
g before e, i
ǯ
c before e, i or on the end
of word
č
sc before e, i
š
g
g
c
k
m
m
n
n
gn
ɲ
n before consonants
ŋ
r
r
l
l
CC
u before vowels
w
i
y
i
i
ü
ü
e, é
e
e, è
ɛ
a
a
o, ó
o
ò
ɔ
u
u
V
, V
ˈV
Note: In word-final position, consonants become devoiced.
Emiliano-Romagnol
Emiliano-Romagnol is divided into Emilian and Romagnol dialect groups, with many dialects and
subdialects in every group.
For discrepancies between the lists see bark, big, bite, breast, cold, feather, head, kill,
lie, many, not, person, rain, stone, swim, what, thin, worm.
The following transliterational chart covers our principal sources:
Ercolani
Modern Orthography
MosLex
b
b
b
p
p
p
v
v
v
f
f
f
d
d
d
t
t
t
z
ʒ
z
c
ş
ś
z
s
s
s
g before e, i
g before e, i
ǯ
c before e, i; cc on the
end of the word
c before e, i; ć
(Carpigiano); ç
(Reggiano)
č
g
g
g
c
c
k
m
m
m
n
n
n
gn
gn
ɲ
n
n
ŋ
r
r
r
l
l
l
j
i
y
i
i
i
e
é
e
ē
ē
ê
-
eə
é
-
è
è
ɛ
ë
-
ɛə
a
a
a
â
-
ã
ó
ó
o
ô
ou
ow
ò
ò
ɔ
ö
-
ɔə
u
u
u
-
V
Ligurian
The Ligurian language is divided into four dialects: Intemelio, Western, Genoese, and Cinque Terre.
Unfortunately, we had no possibility to collect the full list for all the dialects, so we only present the
lists for the Genoese dialect, the Rapallo subdialect, and the dialect of Stella below.
For discrepancies between the lists see breast, feather, lie, many, mountain, person, rain,
sand, warm, snake.
The Ligurian orthographies are transliterated as followsː
Ligurian orthographies
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
z
ʒ
z
c
s
z
s, ç
s
g before e, i, æ
ǯ
c before e, i, æ
č
x
ž
sci
š
g
g
c
k
m
m
n
n
gn
ɲ
-n
ŋ
r
r
l
l
u, ou before vowels
w
ü before vowels
ɥ
j, i
y
CC
i
i
ü
ü
é
e
ê
è
ɛ
eu; ö
ö
æ
ä
a
a
o
o
ô
ò
ɔ
ö
ɔː
u; ǒ
u
V
Note: Mr. Fasce gave all the forms in the IPA transcription, so we do not discuss the forms in
Ligurian orthography for the Rapallo dialect.
Venetian
The Venetian language is divided into Central Venetian (Padua, Vicenza, Polesine), North-Eastern
Venetian (Treviso, Feltre, Belluno), Western Venetian (Verona), Trentinian, Venetian Proper
(Venice) and different "colonial" dialects [Chelysheva 2001: 116]. Although we have lists for
different dialects, most of them dont demonstrate significant differences in the 110-item list. The
dialects, which really stand out, are the Bellunese one and the Primiero one (the subdialect of
Bellunese). So we present three Venetian lists below: common Venetian, Bellunese and Primiero.
The Venetian orthography is transliterated as follows:
Venetian orthographies
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
đ (Primiero dialect)
ð
(Primiero dialect)
θ
z
c
x; z
z
s; ç
s
g before e, i
ǯ
c before e, i
č
g; gh before e, i
g
c; ch before e, i
k
m
m
n
n
gn
ɲ
n on the end of a syllable
or a word
ŋ
r
r
l
l
u
w
i
y
ł
ɰ; e
i
i
e; é
e
è
ɛ
a
a
o; ó
o
ò
ɔ
u
u
V
, V
ˈV
Note 1: the letters {x} and {z} are used for the sound z in different variants of the Venetian
orthography. Some informants prefer the first (more traditional) variant, some prefer the second
(more similar to the Italian orthography).
Note 2: the pronunciation of {ł} can vary depending on dialect.
Italian
We present the lists for Dantes Italian, Standard Italian, the Tuscan dialect of Grosseto and the
Umbrian dialect of Foligno.
Dantes Italian wordlist is based on Dante Alighieris works. Dante (1265-1321, born in Florence) is
considered the father of the literary Italian language. The main source for the wordlist was the
Divine Comedy (1306-1321), but in some difficult cases we have also consulted Dantes minor
works written in Italian (Vita nuova, Convivio, Il Fiore, Rime, Detto dAmore). For the examples,
we have chosen to use Longfellows translation of Divine Comedy which is closest to the original,
being a nearly literal translation. In the most ambiguous cases we resort to the aid of Petrarchs
(1304-1374, born in Arezzo) poems and Boccaccios (1313-1375, born in Florence) prose.
All of Dantes works are available online in Italian and English at Princeton Dante Project:
http://etcweb.princeton.edu/dante/pdp
Longfellows translation of the Divine Comedy may be found at Project Gutenberg:
http://www.gutenberg.org/cache/epub/1004/pg1004.html
Abbreviations: Inf. = Inferno; Purg. = Purgatorio; Par. = Paradiso; VN = Vita nuova; Conv. =
Convivio; Dec. = Decameron.
The Italian orthography is transliterated as follows:
Italian orthography
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
z
ʒ
z
c
s
z
s
s
g before e, i
ǯ
c before e, i
č
sc before e, i
š
g
g
c
k
m
m
n
n
gn
ɲ
n before g,c
ŋ
r
r
l
l
gl
ʎ
CC
u
w
i
y
i
i
e
e
e
ɛ
a
a
o
o
o
ɔ
u
u
Neapolitan
Unlike the lects of Northern Italy, Neapolitan dialects do not feature as many discrepancies between
them in the Swadesh list. For this reason, we prepared only one list for Neapolitan.
The Neapolitan orthography is transliterated as follows:
Neapolitan orthography
MosLex
b
b
p
p
v between two vowels
β
v
v
f
f
d
d
t
t
d between two vowels
ɾ
z
c
s
s
g before e, i
ǯ
c before e, i
š
cc before e, i
čː
g
g
c
k
g between two vowels
ɣ
m
m
n
n
gn
ɲ
n before g, c
ŋ
r
r
l
l
gl
ʎ
CC
u
w
j
y
i
i
e
e, ɛ
unstressed e, o;
unstressed a on the end of
a word
ə
a
a
o
o, ɔ
u
u
V
, V
ˈV
Although the Neapolitan orthography does not make any distinctions between open and closed
vowels, the relevant information was provided to us by the informants.
Logudorese
There is no consensus as to the dialectal division of Logudorese [Narumov 2001: 161]. According to
the Glottolog scheme, Logudorese Sardinian is divided into Barbaricino, Northwestern Logudorese
and Southwestern Logudorese. Unfortunately, we had no possibility to collect the full lists for all the
dialects, so only the list for the Barbaricino dialect is provided.
The Logudorese orthography is transliterated as follows:
Logudorese orthography
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
th
θ
dz
ʒ
z
c
s
s
dd
ɖː
j
ž
g
g
c
k
m
m
n
n
r
r
l
l
i
y
CC
i
i
e
e
a
a
o
o
u
u
V
ˈV
Campidanese
Campidanese Sardinian is divided into Western Campidanese, Sub-Barbaricino, Ogliastrino, Central
Campidanese, Sarrabense, Sulcitano, and Cagliari dialects (South Campidanese) [Narumov 2001:
161]. Unfortunately, we had no possibility to collect full lists for all the dialects. The list for the
Cagliari dialect serves as our primary object of comparison; additionally, the notes section also
include data on the Domus de Maria subdialect.
The Campidanese orthography is transliterated as follows:
Campidanese
orthography
MosLex
b
b
p
p
b
β
v
v
f
f
d
d
t
t
d
ð
z
c
s
s
g before e, i
ž, ǯ
c before e, i
č
x
ž
sc
š
g
g
c
k
g
ɣ
m
m
m
n
r
r
l
l
CC
u
w
i
y
i
i
e
e, ɛ
a
a
o
o, ɔ
u
u
-
ˈV
Although the Campidanese orthography does not make any distinctions between open and closed
vowels, they do exist; the information was provided by Mr. Pintus and Mrs. Fadda.
Sicilian
Below we present the lists for Palermitan, Messinese, Catanian and South-Eastern (comprises the
lists for the dialects of Ragusa and Agrigento) dialects, which slightly differ from each over, see
big, burn, feather, lie, man, rain, small, tree, warm, snake.
Although Buscemi is geographically closer to Ragusa and Agrigente, the Buscemese list was added
to the Palermitan comments due to its close resemblance (only three items differ: man, rain and
tree).
The Sicilian orthography is transliterated as follows:
Sicilian orthography
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
z
c
s
s
dd
ɖː
dr
ɖr
tr
ʈɽ
rr
ʐ; rː (depending on dialect)
str
ʂɽ
g before e, i
ǯ
c before e, i
č
sc before e, i
š
g
g
c
k
m
m
n
n
gn
ɲ
n before c, g
ŋ
r
ɾ
l
l
CC
u before a vowel
w
i before a vowel
y
i
i
i (unstressed and non-
initial)
ɨ (Messinese dialect)
e
ɛ
a
a
o
ɔ
u
u
V
ˈV
Catalan
The Catalan language is divided into two major groups of dialects: eastern (Northern, Central,
Balearic and Algherese dialects) and western (North-Western and Valencian dialects). The Maon
Minorcan subdialect belongs to the Balearic dialect, the subdialects of Castelló de la Plana, Valencia
and Manises belong to the Valencian dialect. The subdialects of Les Garrigues and Oliana belong to
the North-Western dialect.
Catalan orthography is transliterated as follows:
Catalan orthography (Central Catalan
pronunciation)
MosLex
v, b in the beginning of a word or after m
b
v, b
β
p
p
f
f
d in the beginning of a word or after n, m, l
d
d
ð
t
t
s between two vowels or before a voiced
consonant
z
s; ss between two vowels; ç; c before e, i
s
tg
ǯ
tx
č
j
ž
x
š
g in the beginning of a word or after ŋ, m
g
g
ɣ
c
k
m
m
n
n
ny
ɲ
n
ŋ
r
ɾ
rr, r at the beginning of a word
r
l
ɫ
ll
ʎ
u
w
i
y
i
i
e
e
e
ɛ
unstressed e and a
ə
a
a
o
o
o
ɔ
u, unstressed o
u
-
ˈV
Note: In word-final position, consonants become devoiced.
The transcription for Central Catalan, Minorcan and Les Garrigues dialects was made by Mr. Alòs i
Font, Mr. Cardona and Mr. Montagut respectively.
Spanish
Spanish orthography is transliterated as follows:
Spanish orthography
MosLex
b, v in the beginning of a word or
after n, m
b
p
p
b
β
f
f
d in the beginning of a word or after
n, l
d
t
t
d
ð
c before e, i; z
θ
s before a voiced consonant
z
s
s
ch
č
g in the beginning of a word or after
n, but not before e, i; gu before e, i
g
qu before e, i; c
k
g
ɣ
j; g before e, i
x
m
m
n
n
ñ
ɲ
n before c, g
ŋ
rr or r in the beginning of a word
r
r
ɾ
l
l
ll
ʎ
y
ʑ
u, ü
w
i
y
i
i
e
e
a
a
o
o
u
u
V
ˈV
Asturian
The Asturian language uses Spanish orthography. The main difference is that the letter x is used for
the sound š. Apart from that, there are no sounds β, ð and ɣ.
Standard Portuguese
The pronunciation and semantics are restricted to Portugal Portuguese language (especially in its
Lisboa variant).
Portuguese orthography is transliterated as follows:
Portuguese orthography
MosLex
b
b
p
p
b between two vowels
β
v
v
f
f
d
d
t
t
d between two vowels
ð
z, s between two vowels
z
s; ss; ç
s
j; g before e, i; s before a voiced consonant
ž
ch; z, s in the end of a word or before an
unvoiced consonant
š
g; gu before e, i
g
c; qu before e, i
k
g between two vowels
ɣ
m
m
n
n
nh
ɲ
n before g, c
ŋ
rr; word initially; after l, n
ʁ
r
ɾ
l
l
word finally; before a consonant
ɫ
lh
ʎ
u
w
i
y
i; unstressed e word initially
i
in, im
ĩ
e, é
e
e, ê
ɛ
en, em
unstressed e in the word-final position or
before a consonant
ə
stressed a
a
unstressed a; stressed a before m, n, nh; e
before ɲ, ʎ, ž, š, y
ɐ
an, am
ɐ

ãe, em, ens word finally
ɐ
y

ão, am word finally
ɐ
w
o, ó
o
o, ô
ɔ
on, om
õ
u; unstressed o
u
um
ũ
ui
ũy

-
ˈV
Galician
The Galician orthography is transliterated as follows:
Galician orthography
MosLex
b, v
b
p
p
b, v between two vowels
β
f
f
d
d
t
t
d between two vowels
ð
z; c before e, i
θ
s
s
ch
č
x
š
g
g
c
k
g between two vowels
ɣ
m
m
n
n
ñ
ɲ
nh; n word finally
ŋ
rr, r word initially
r
r
ɾ
l
l
ll
ʎ
eu
ew
ou
ow
ei
ey
oi
oy
i
i
e
e
e
ɛ
a
a
o
o
o
ɔ
u
u
-
ˈV
Occitan
The Provençal dialect belongs to the southern Occitan dialects, besides the Languedocian dialect
[Lobodanov, Morozova, Chelysheva 2001: 280].
Coupiers transcription is transliterated as follows:
Coupier
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
s between two vowels
z
s; ss; ç; c before e, i
s
j
ǯ
ch
č
g
g
c
k
m
m
n
n
gn
ɲ
before g, c; word finally
ŋ
rr; word initaially; after n,
l, s, z
ʁ
r
ɾ
l
l
u
w
i
y
i
i
u
ü
e
e
è
ɛ
a
a
o
o
ò
ɔ
ou
u
V
, V
ˈV
Note: In word-final position, consonants become devoiced.
Franco-Provençal
The following transliterational chart covers our principal sources:
Virets transcription
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
zh
ð
sh
θ
z
z
s; s between two vowels
s
dj
ǯ
j
ž
ch
š
g
g
k
k
m
m
n
n
ny
ɲ
r
r
l
l
ly
ʎ
w
w
i
y
i
i
u
ü
é
e
eu
ö
ê, è
ɛ
eu
œ
a
a
ô
o
o
ɔ
ou
u
ihn
ĩ
in
ɛ
an
ã
on
ɔ
circumflex sign
-
ˈV
French
Here we present the lists for Chrétien de Troyes Old French and Modern Standard French.
Chrétien de Troyes (1130-1191) is the author of five poems: Erec and Enide; Cligès; Yvain, the
Knight of the Lion; Lancelot, the Knight of the Cart; Perceval, the Story of the Grail. Full texts are
available at http://www.atilf.fr/dect. English translation of the first four poems can be found at
http://omacl.org.
Old French words are transcribed according to XII century French phonetics sketch made by E.
Einhorn [Einhorn 1999: 2-13]. The Old French orthography is transliterated as follows:
Old French orthography
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
z; c before e, i
c
s between two vowels
z
s; ss between two vowels
s
j; g before e, i
ǯ
ch
č
g, gu before e, i
g
c; qu before e, i
k
m
m
n
n
gn; ng word finally
ɲ
n before c, g
ŋ
r
r
l
l
il, ill
ʎ
u after vowel
w
u before vowel
ɥ
i
y
i
i
u
ü
e
e, eː
e
ɛ
e
ə
a
a
o
o, oː
o
ɔ
ai
ey
oi
oy
en
ãn
-
ˈV
Note: vowels before m and n become nasalized.
Standard French orthography is transliterated as follows:
French orthography
MosLex
b
b
p
p
v
v
f
f
d
d
t
t
z, s between two vowels
z
s; ss; ç
s
j; g before e, i
ž
ch
š
g; gu before e, i
g
c; qu before e, i
k
m
m
n
n
gn
ɲ
n before g, c
ŋ
r
ʁ
l
l
u before a voweil
ɥ
i, ill
y
i
i
u, û
ü
e, é
e
e, ê, ai
ɛ
in, im, ein not before a
vowel
ɛ
eu, œu
œ, ö
un
œ
unstressed e in some
positions
ə
a, à
a, ɑ
en, em, an, am
ɑ
o, ó, au, eau
o
o, ô
ɔ
on, om
ɔ
ou
u
oi
wa
Picard
The Picard words are written in the Feller-Carton orthography, which is based on that of Standard
French. The main discrepancies are: pronunciation of en as ɛ; use of the letter k before e and i for the
sound k; use of the trigraph tch for č; use of for the diphthong we.
Walloon
The Walloon orthographies are transliterated as follows:
Transinne (Feller orthography)
Rifondou
MosLex
b
b
b
p
p
p
v
v
v
f
f
f
d
d
d
t
t
t
z
z
z
s; з; ss between two vowels; c
before e, i
s; з; ss between two
vowels; c before e, i
s
sc
sch
sk
dj, g before e, i
dj, g before e, i
ǯ
tch
tch
č
j
jh
ž
ch
xh
š
-
xh word finally
ʆ
g
g
g
c, k
c, k
k
m
m
m
n
n
n
gn
gn
ɲ
r
r
ʀ
l
l
l
w
w
w
y
y
y
h
h
h
i
i
i
u
u
ü
é, e
é, e
e
ê
ea
ɛː
eu
eu
ö
è, e
è, e
ɛ
eu
eu
œ
a
a
a
-
е
ɔː
ô, aw
ô
o
o
o
ɔ
ou
ou
u
-
oe
wɛ
aw
е
aw
an, am
an, am
ã
in
in
ɛ
-
én
on
on
ɔ
circumflex sign
circumflex sign
Note 1: Consonants in word-final position become voiceless.
Note 2: Vowels ü and i (in Rifondou only i) are lost in the first syllable if a preceding word ends in a
vowel, cf. {in ptit gamin} "a small boy" and {ene pitite gamine} "a small girl".
References & sources (general)
Beekes 2010 = Beekes R. S. P. Etymological Dictionary of Greek. Leiden, Boston: Brill.
Chelysheva 2001 = Челышева И. И.. Диалекты Италии [Chelysheva, I. I. The dialects of
Italia]. In: Языки мира. Романские языки. М.: Academia, 91-145 // A grammar sketch of
the Italian lects.
Haiman & Benincà 1992 = Haiman J., Benincà P. The Rhaeto-Romance languages. London, New
York: Routledge. // A sketch of Romansh, Ladin and Friulian languages.
Orel 1998 = Orel, Vladimir. Albanian Etymological Dictionary. Leiden/Boston/Köln: Brill. // One of
the latest major reference works on Albanian etymology. Contains important original work
on the reconstruction of Proto-Albanian and its external connections, with references to
etymological research of all of the authors predecessors.
Prati 1951 = Prati A. Vocabolario etimologico italiano. Torino: Tip. Ditta Fratelli Pozzo.
Walde & Hofmann 1938 = Walde A., Hofmann J. B. Lateinisches Etymologisches
Wörterbuch. Heidelberg: Carl Winters Universitätsbuchhandlung.
References & sources (Romance lexicostatistics)
Blažek V. Balkano-románské jazyky: lexikostatistický test. In: Linguistica Brunensia, LX, 12,
2012, s. 6183. // A lexicostatistical classification of Balkan-Romance languages based on
the same principles as in Blažek’s previous article; wordlists for Romanian, Moldavian,
Arumanian, Megleno Romanian, and Istro Romanian are included in the article.
Guţu-Romalo V. Stabilirea datei de separaţie a aromînei de dacoromînă cu ajutorul glotocronologiei.
In: Studii și cercetări lingvistice, 10/4, 1959, pp. 576-584. // First ever publication on
Balkan-Romance lexicostatistics.
Kroeber A. L. Romance history and glottochronology. In: Language, Vol. 34, No 4, 1958, pp. 454-
457. // A response to Reas article.
Novotná P., Blažek V. Rétorománské jazyky – přehled a klasifikace. In: Linguistica Brunensia,
LVII, 2008, s. 15-32. // An attempt at a lexicostatistical classification of the Rheto-Romance
languages, using Sergei Starostins recalibrated glottochronology, but not such a strict
approach to semantics as employed in the MosLex; wordlists for Standard Italian,
Rumantsch Grischun, Friulian, and Dolomito Ladin are included in the article.
Rea J.A. Concerning the Validity of Lexicostatistics In: International Journal of American
Linguistics, Vol. 24, No. 2, 1958, pp. 145-150. // A critical review of the lexicostatistical
method on the basis of eight Romance languages (French, Spanish, Romanian, Catalan,
Italian, Sardinian, Portuguese, Rheto-Romance), wordlists for all of whom are included into
the article.
References & sources (Latin)
All Latin texts are available at the Latin Library: http://www.thelatinlibrary.com.
Quotations from Plautus texts are given in the English translation done by H. T. Riley (available at:
http://www.perseus.tufts.edu/hopper/), with the exception of Vidularia which was translated
by P. Nixon (available at:
http://www.archive.org/stream/plautuswithengli05plauuoft#page/336/mode/2up).
Catos De agri cultura is quoted in W. D. Hoopers and H. B. Ashs translation (available at:
http://penelope.uchicago.edu/Thayer/E/Roman/Texts/Cato/De
Agricultura/home.html).
Quotations from "Metamorphoses" are given in the English translation done by A. S. Kline
(available at: http://www.poetryintranslation.com/klineasapuleius.htm), which is written in a
more modern language than the сlassic translation by W. Adlington (1566). "Apology" was
translated by H. E. Butler (available at:
http://www.chieftainsys.freeserve.co.uk/apuleius
apology01.htm), "Florida" is quoted
according to http://www.attalus.org/translate/florida.html.
Translation of "Attic Nights" (books 1-13) can be found at
http://penelope.uchicago.edu/Thayer/E/Roman/Texts/Gellius/home.html; books 14-20 are
available in an older translation:
https://archive.org/stream/atticnightsaulu02gellgoog#page/n335/mode/2up.
The English translation of "Satyricon" is available at: http://www.gutenberg.org/files/5225/5225-
h/5225-h.htm.
The text of the Bible is quoted according to Contemporary English Version (1999), available at:
http://www.biblija.net/biblija.cgi?l=en.
Kroonen 2013 = Kroonen, G. Etymological Dictionary of Proto-Germanic, Leiden, Boston: Brill.
Lewis & Short 1958 = Lewis, Ch., Short, Ch. A Latin Dictionary, Oxford: Clarendon Press.
Normanskaya 2005 = Норманская, Ю. В. Генезис и развитие системы цветообозначений в
древних индоевропейских языках [Normanskaya Yu. V. The genesis and development of
colour designations system in the ancient Indo-European languages]. М.: Спорт и культура
// A series of etymological sketches on colour names in Indo-European languages, including
Latin.
OLD 1968 = Oxford Latin Dictionary. Oxford: Oxford University Press // A comprehensive
dictionary of the Latin language.
Passera 1998 = Passera, E. The semantic evolution of the late Latin terms domina, femina and mulier
in the Italian language. // Quaderni ditalianistica, XIX, 2, 1998, 105-125.
Solopov 2007 = Солопов, А. И. Цветообозначения в латинском языке [Solopov A. I., Colour
designations in the Latin langugae]. In: Наименования цвета в индоевропейских языках.
М.: КомКнига, 2007, 66-76 // A short description of colour designations in Latin.
De Vaan 2008 = De Vaan, M. Etymological dictionary of Latin and the other Italic languages,
Leiden, Boston: Brill.
References & sources (Megleno Romanian).
Capidan 1925 = Capidan, Th. Meglenoromânii. Istoria și graiul lor. București: Cultura Națională. //
A grammar sketch of the Megleno Romanian language.
Capidan 1928 = Capidan, Th. Meglenoromânii. Literatura populară la meglenoromâni. București:
Cultura Națională. // Megleno Romanian texts (folk songs, fairy tales, riddles and proverbs).
Capidan 1935 = Capidan, Th. Meglenoromânii. Dicționar meglenoromân. București: Cultura
Națională. // Megleno Romanian-Romanian dictionary.
References & sources (Istro Romanian)
Byhan 1899 = Byhan, A. Istrorumänisches Glossar. In: Jahresbericht des Instituts für rumänische
Sprache 6. Leipzig: Johann Ambrosius Barth, S. 174-396. // A concise Istro Romanian-
German dictionary.
Glavina 1905 = Glavina, A. Calindaru lu rumeri din Istrie. București. // Contains concise Istro
Romanian-Romanian vocabulary, which includes the words absent from Romanian or which
phonetically differ from Romanian (as a rule, with rhotacism n > r).
Kovačec 1998 = Kovačec, A. Istrorumunjsko-Hrvatski rječnik (s gramatikom i tekstovima). Pula:
Znanstvena udruga Mediteran. // Istro Romanian-Croatian dictionary.
Kovačec 2010 = Kovačec, A. Istrorumunjsko-Hrvatski rječnik (s gramatikom i tekstovima).
Available at: http://www.vlaski-zejanski.com/Resursi-i-biblioteke/rjecnik. // Electronic
version of the Istro Romanian-Croatian dictionary [Kovačec 1998].
Narumov 2001 = Нарумов, Б. П. Истрорумынский язык/диалект [Narumov B.P. Istro Romanian
language/dialect]. In: Языки мира. Романские языки. М.: Academia, pp. 656-671. // A
grammar sketch of the Istro Romanian language.
Sârbu & Frăţilă 1998 = Sârbu, R.; Frăţilă, V. Dialectul istroromân. Texte și glosar. Timișoara. //
Istro Romanian texts written down primarily at Žejane, and the glossary based on it.
Contains about 2500 entries.
Weigand 1894 = Weigand, G. Istrisches. In: Jahresbericht des Instituts für rumänische Sprache 1.
Leipzig: Johann Ambrosius Barth, S. 122-155. // Istro Romanian texts.
Weigand 1894 = Weigand, G. Nachträge zu vorstehendem Glossar. In: Jahresbericht des Instituts
für rumänische Sprache 6. Leipzig: Johann Ambrosius Barth, S. 397-398. // Some remarks on
Byhans glossary.
Texts recorded in the framework of the project Očuvanje vlaškog i žejanskog jezika / Očuvęj vlåška
și žejånska limba. Available at: http://www.vlaski-zejanski.com. // A database of Istro
Romanian texts with original audios and translations into English and Croatian.
References & sources (Aromanian)
Bara et al. 2005 = Бара, М.; Каль, Т.; Соболев, А.Н. Южноаромынский говор села Турья
(Пинд). Синтаксис, лексика, этнолингвистика, тексты. [Bara M., Kahl T., Sobolev
A.N. A southern Aromanian dialect of Turia village (Pind). Syntax, vocabulary,
ethnolinguistics, texts.] München. // Materials of the 2002 expedition to the village where
one of the southern dialects is spoke.
Capidan 1932 = Capidan, Th. Aromânii. Dialectul aromân. Studiu lingvistic. București. // Grammar
sketch of the Aromanian language.
Cunia 2010 = Cunia, T. Dictsiunar a limbãljelei armãneascã. Available at:
https://archive.org/details/DictsiunarArmanescuDec2008. // Large Aromanian explanatory
dictionary, based mainly on Papahagis dictionary.
Dalametra 1906 = Dalametra, I. Dicţionar macedo-român. București. // Concise Aromanian-
Romanian dictionary.
Gołąb 1984 = Gołąb, Z. The Aromanian dialect of Kruševo in SR Macedonia SFR Yugoslavia.
Skopje. // Brief Aromanian-English glossary, collected by Z. Gołąb in 1968 in Kruševo,
where the Gramustean dialect is spoken.
Papahagi 1963 = Papahagi, T. Dicţionarul dialectului aromîn general și etimologic. București. //
Comprehensive Aromanian-Romanian / French dictionary.
References & sources (Romanian)
Andronescu 1964 = Andronescu Ș. Dicționar romîn-englez. New York: Saphrograph. // Romanian-
English dictionary.
Bolocan et al. 1985 = Bolocan, Gh.; Voronţova, T.; Șodolescu-Silvestru, E. Dicţionar rus-român.
București: Editura Știinţifică și Enciclopedică. // Large Russian-Romanian dictionary.
Borsh & Zaporozhan 1990 = Борш, А.; Запорожан, И. Дикционар рус-молдовенеск [Borsh, A.,
Zaporozhan, I. Russian-Moldavian dictionary]. Кишинэу: Редакция принчипалэ а
енчиклопедией советиче молдовенешть. // Large Russian-Moldavian dictionary (about
30 000 words).
Ciorănescu 2007 = Ciorănescu A. Dicționarul etimologic al limbii române. București: Editura
Saeculum I. O. // Romanian etymological dictionary.
Ciorănescu 2015 = Ciorănescu A. Dicționarul etimologic al limbii române. Available at
http://www.dex.ro. // Electronic version of the Romanian etymological dictionary
[Ciorănescu 2007].
DEaLR 2012 = Dicționarul explicativ al limbii române. București: Univers Enciclopedic Gold. //
Romanian explanatory dictionary.
DEaLR 2015 = Dicționarul explicativ al limbii române. Available at http://www.dex.ro. //
Electronic version of the Romanian explanatory dictionary [DEaLR 2012].
DER 2004 = Dicționar englez-român. București: Univers Enciclopedic. // Comprehensive English-
Romanian dictionary.
Gancz 2015 = Information provided by Mr. Andrei Gancz, who was so kind as to fill the
questionnaire and comment on some sophisticated cases.
Lukht & Narumov 2001 = Лухт, Л. И.; Нарумов, Б. П. Румынский язык [Lukht, L. I, Narumov, B.
P. Romanian language]. // Языки мира. Романские языки. М.: Academia, pp. 574-636. // A
grammar sketch of the Romanian language.
Podiko 1973 = Подико, М. В. Русско-молдавский словарь [Podiko, M. V. Russian-Moldavian
dictionary]. Кишинев: Картя Молдовеняскэ. // Large Russian-Moldavian dictionary (about
35 000 words).
References & sources (Dalmatian)
Bartoli 2002 = Bartoli, M. G. Il Dalmatico: Resti di unantica lingua romanza parlata da Veglia a
Ragusa e sua collocazione nella Romània appennino-balcanica. Roma: Instituto della
Enciclopedia Italiana. // The main source of our knowledge about Dalmatian. Comprises not
only Bartolis own notes, but also notes made by other scholars of Dalmatian (G. Cubich, A.
Zonca, P. Petris, A. Adelmann, M. Celebrini, A. Ive and T. Burada) and data from ancient
Dalmatian manuscripts.
Repina & Narumov 2001 = Репина Т. А.; Нарумов, Б. П. Далматинский язык [Lukht, L. I,
Narumov, B. P. Romanian language]. In: Языки мира. Романские языки. М.: Academia,
pp. 681-694. // A grammar sketch of the Dalmatian language.
References & sources (Friulian)
Decorte 2015 and Virili 2015 = Information kindly provided by Mr. Andrea Decorte (Palmanova)
and Mrs. Astrid Virili (Udine), who filled the questionnaire and gave comments where
necessary.
Narumov & Sukhachev 2001 = Нарумов, Б. П.; Сухачёв, Н. Л. Фриульский язык [Narumov, B.
P., Sukhachev, N. L. Friulian language]. In: Языки мира. Романские языки. М.: Academia,
pp. 365-391. // A grammar sketch of the Friulian language.
Pirona 1871 = Pirona V. Vocabolario friulano. Venezia: Stabilimento Antonelli. // Large Friulian-
Italian dictionary.
References & sources (Ladin)
DILF 2001 = Dizionario Italiano Ladino Fassano. Vigo di Fassa. // Large Italian-Fassano and
Fassano-Italian dictionary.
Forni 2013 = Forni, M. Dizionario Italiano-Ladino Gardenese. Dizioner Ladin de Gherdëina-
Talian. San Martino in Badia: Istitut Ladin Micurà de Rü. // Comprehensive Italian-
Gardenese and Gardenese-Italian dictionary.
Forni 2015 = Forni, M. Dizionario Italiano-Ladino Gardenese. Dizioner Ladin de Gherdëina-
Talian. Available at: http://forniita.ladinternet.it. // Electronic version of the Italian-
Gardenese and Gardenese-Italian dictionary [Forni 2013].
Gartner 1923 = Gartner, Th. Ladinische Wörter aus den Dolomitentälern. Halle: Verlag von Max
Niemeyer. // Gardenese-German and German-Gardenese dictionary with some additional
information on Marebbano, Badiotto, Livinallese and Nones dialects.
Narumov 2001 = Нарумов, Б. П. Ладинский язык [Narumov, B. P. Ladin language]. In: Языки
мира. Романские языки. М.: Academia, pp. 392-410. // A grammar sketch of the Ladin
language.
Pleteršnik 2014 = Pleteršnik M. Slovensko-nemški slovar, Ljubljana: Založba ZRC. Available at:
http://www.fran.si/136/maks-pletersnik-slovensko-nemski-slovar // Slovene-German
dictionary.
References & sources (Romansh)
Schmid 2015, Conrad 2015, Cadruvi 2015, Vital 2015 = The majority of data was provided by Mr.
Lino Schmid (Rumantsch Grischun and Surmiran dialect), Mrs. Maria Cadruvi (Sursilvan
dialect), Mr. Giancarlo Conrad and Mrs. Bettina Vital (Vallader dialect), and who kindly
agreed to fill in the questionnaire and gave some further comments on Romansh.
Decurtins 2001 = Decurtins, A. Niev vocabulari romontsch: surslivan-tudestg. Chur: Legat Anton
Cadonau. // Comprehensive German-Sursilvan and Sursilvan-German dictionary.
Decurtins 2015 = Decurtins, A. Niev vocabulari romontsch: surslivan-tudestg. Available at:
ttp://www.vocabularisursilvan.ch // Electronic version of the German-Sursilvan and
Sursilvan-German dictionary [Decurtins 2001].
Pallioppi & Pallioppi 1895 = Pallioppi, Z.; Pallioppi, E. Dizionari dels idioms romauntschs
dEngiadin ota e bassa, della Val Mstair, da Bravuogen e Filisur. Samedan: Stamperia da
Simon Tanner. // Large Romansh-German dictionary (Puter and Vallader dialects).
Sukhachev & Gorenko 2001 = Сухачёв, Н. Л.; Горенко, Г. М. Ретороманский язык [Sukhachev,
N. L, Gorenko, B. P. Rhaeto-Romance language]. In: Языки мира. Романские
языки. М.: Academia, pp. 335-391. // A grammar sketch of the Romansh language.
There are also on-line dictionaries of Romansh (http://www.pledari.ch/ ;
http://www.udg.ch/dicziunari/vallader/impressum ;
http://www.udg.ch/dicziunari/puter/impressum), but we did not use them in our work.
References & sources (Piemontese)
Gisolo 2015, Fiandro 2015, 2016, Senero 2015, Davico 2016, Noris 2015 = Most of the information
was provided by Mr. Paolo Gisolo (Lanzo Torinese dialect), Mr. Federico Fiandro (Barbania
dialect), Mr. Giuseppe Senero (Carmagnola dialect), Mr. Loris Davico (Turinese dialect) and
Mr. Maurizio Noris (Vercellese dialect), who were so kind to fill the questionnaire and
comment on some sophisticated cases.
Brero 1976 = Brero, C. Vocabolario Italiano-Piemontese. Torino: Editrice Piemonte in Bancarella.
// Short Italian-Piemontese dictionary.
There are also on-line and self-made dictionaries of Piemontese
(http://www.piemunteis.it/dep/dizionario.dep ; ttp://www.piemonteis.com ;
http://www.piemondo.it/dizionario-della-lingua-tedesca-e-piemontese.html), but we did not
use them in our work.
References & sources (Lombard)
Selva 2015, Garlini 2015 = Most of the information was provided by Mr. Bruno Garlini (Bergamo
dialect) and Mr. Franco Selva (Plesio dialect), who were so kind to fill the questionnaire and
comment on some complicated cases.
References & sources (Emiliano-Romagnol)
Ercolani 1960 = Ercolani L. Vocabolario romagnolo-italiano. Monte di Ravenna. // Dictionary of
the Ravennate dialect (Romagnol).
Piacentini 2015, Sacchi 2015, Chertein 2015, Pucci 2015 = The majority of information was kindly
provided by Mr. Diego Piacentini (Ferrarese dialect, Emiliano), Mrs. Gloria Sacchi
(Carpigiano dialect, Emiliano), Mr. Gabriele Chertein (Reggiano dialect, Emiliano) and Mr.
Alessandro Pucci (Marchigiano dialect, Romagnol), who filled the questionnaire and gave
comments where necessary. Unfortunately Mr. Puccis idiolect is highly influenced by
literary Italian, so we dont have enough material to make a separate Marchigiano list and
have to give only some Marchigiano forms in the comments to the Ravennate dialect.
References & sources (Ligurian)
Fasce 2015, Parodi 2015, Piccone 2015 = All the data were provided by Mr. Maurizio Fasce, a
native speaker of the Rapallo subdialect, which is very close to the Genoese dialect, Mr.
Piero Parodi, a native speaker of the Genoese dialect, and Mr. Antonello Piccone, a native
speaker of the Stella dialect (Province of Savona), who kindly agreed to fill in the
questionnaire and gave some further comments on Ligurian.
References & sources (Venetian)
Tosi 2015; Gasparini 2015; Tre 2015; Melon 2015; Busato 2015; Poletto 2015; Cortiana 2015;
Ricchieri 2015; Clementi 2015; Pezzin 2015; Zanetti 2015; Gaio 2015; Caneve 2015 = The
data provided by Mr. Leonardo Tosi, Mr. Germano Gasparini (the dialect of Venice), Mr.
Massimo Tre (the dialect of Padua), Mr. Mauro Melon (the dialect of Este), Mr. Tiziano
Busato (the dialect of Treviso), Mrs. Katia Poletto (the dialect of Salgareda), Mr. Alessio
Cortiana, Mr. Silvio Ricchieri (the dialect of Vicenza), Mr. Moreno Clementi (the dialect of
Schio), Mr. Daniele Serena (the dialect of Arzignano), Mr. Paolo Pezzin (the dialect of
Marostica), Mr. Cristian Zanetti (the dialect of Verona), Mr. Paolo Gaio (the dialect of
Primiero), Mr. Marco Caneve (the dialect of Belluno), who kindly agreed to fill in the
questionnaire and gave some further comments on Venetian.
References & sources (Italian)
Battaglia = Battaglia S. Grande dizionario della lingua italiana. Torino: Union Tipografico-Editrice
Torinese, 1980-2002. // Comprehensive Italian explanatory dictionary in 21 volumes.
Chelysheva & Cherdantseva 2001 = Челышева И. И.; Черданцева, Т. З. Итальянский
язык [Chelysheva, I. I, Cherdantseva, T. Z. Italian language]. In: Языки мира. Романские
языки. М.: Academia, pp. 56-90. // A grammar sketch of the Italian language.
Cherdantseva 2005 = Черданцева, Т. З. Краткий грамматический справочник по итальянскому
языку [Cherdantseva, T. Z. A brief manual of Italian grammar]. Москва: ЧеРо. // A
grammar sketch of the Italian language.
Marcelli 2015; Monti Barnocchi-Moscati 2016 = Information provided by Mr. Massimiliano
Marcelli (the Tuscan dialect of Grosseto) and Mr. Mr. Edmondo David Monti Barnocchi-
Moscati (the Umbrian dialect of Foligno) who were so kind to fill the questionnaire and
comment on some complicated cases.
Michel 2011 = Michel A. Einführung in die italienische Sprachwissenschaft. Berlin, New York: De
Gruyter // A study on Italian linguistics
Passerini Tosi 1989 = Passerini Tosi C. Dizionario italiano-inglese, english-italian. Torino: Paravia.
// Comprehensive Italian-English and English-Italian dictionary.
Patota 2002 = Lineamenti di grammatica storica dellitaliano. Bologna: il Mulino // A historical
grammar of Italian
TsLJdO 2016 = Tesoro della Lingua Italiana delle Origini. Available at: tlio.ovi.cnr.it/TLIO // The
project of Italian historical dictionary
Vitali 2015 = Comments on some sophisticated cases, provided by Dr. Daniele Vitali.
Zingarelli 1980 = Zingarelli N. Vocabolario della lingua italiana. Bologna: Zanichelli. // Italian
explanatory dictionary.
Zorko et al. 2002 = Зорько Г. Ф., Майзель Б. Н., Скворцова Н. А. Большой итальянско-русский
словарь [Zorko G. F., Mayzel B. N., Skvortsova N. A. Big Italian-Russian dictionary]. М.:
Русский язык. // Comprehensive Italian-Russian dictionary.
References & sources (Neapolitan)
Esposito 2015; Russo 2015; Matarazzo 2015; Cirillo 2015; Nagar 2015; Mancusi 2015; Musella
2015; Fattoruso 2015; Morelli 2015; Cerrone 2015; Alois 2015 = Information, provided by
Mr. Luca Esposito, Mr. Vincenzo Russo, Mr. Roberto Matarazzo, Mr. Enrico Cirillo, Mr.
Riccardo Nagar, Mr. Davide Mancusi, Mr. Dario Musella (all - Naples), Mr. Antonio
Fattoruso (SantAntonio Abate), Mr. Jenni Morelli (Grottaminarda), Mr. Mirko Cerrone
(Salerno), Mr. Francesco Alois (Caserta), who were so kind to fill the questionnaire and
comment on some complicated cases.
References & sources (Logudorese)
Buttu 2015 = Information kindly provided by Mr. Gianfranco Buttu, who filled the questionnaire
and gave comments where necessary.
Narumov 2001 = Нарумов, Б. П. Сардинский язык [Narumov, B. P. Sardinian language].
In: Языки мира. Романские языки. М.: Academia, pp. 160-186. // A grammar sketch of the
Sardinian language.
There is also an on-line dictionary of Logudorese (http://vocabolariocasu.isresardegna.it/index.php),
but we have not used it in our work.
References & sources (Campidanese)
Ballicu 2015; Pintus 2015, Fadda 2015 = Information kindly provided by Mr. Simone Ballicu, Mr.
Marco Pintus (Cagliari dialect) and Mrs. Alessia Fadda (the subdialect of Domus de Maria
(Cagliari dialect)), who filled the questionnaire and gave comments where necessary.
Narumov 2001 = Нарумов, Б. П. Сардинский язык [Narumov, B. P. Sardinian language].
In: Языки мира. Романские языки. М.: Academia, pp. 160-186. // A grammar sketch of the
Sardinian language.
References & sources (Sicilian)
Messina 2015, Ornato 2015, La Bua 2015, Salzano 2015, Stornanti 2015, Sorbello 2015, Corsaro
2015, La Mattina 2015, Salerno 2015, Leggio 2015, Miccichè 2015, Coccimiglio 2016 =
Information kindly provided by Mr. Salvatore Messina, Mr. Fulvio Ornato (the dialect of
Palermo), Mr. Salvatore La Bua (the subdialect of Termini Imerese), Mr. Alessio Salzano
(the dialect of Messina), Mr. Vincenzo Stornanti (the subdialect of Spadafora), Mr. Giuseppe
Sorbello (the subdialect of Giarre), Mr. Alessandro Corsaro, Mr. Ferdinando La Mattina, Mr.
Federico Salerno (the dialect of Catania), Mr. Carmelo Leggio (the subdialect of Ragusa),
Mr. Agostino Miccichè (the subdialect of Agrigento) and Mr. Luigi Coccimiglio (the
subdialect of Buscemi) who filled the questionnaire and gave comments where necessary.
References & sources (Catalan)
DCVB 2015 = Diccionari català-valencià-balear. Available at: http://dcvb.iec.cat // Electronic
version of a Catalan explanatory dictionary.
EDCC 1993 = Essencial diccionari castellà-català, català-castellà. Barcelona: Biblograf, 1993. //
Small Spanish-Catalan and Catalan-Spanish dictionary.
Narumov 2001 = Нарумов, Б. П. Каталанский язык [Narumov, B. P. Catalan language]. // Языки
мира. Романские языки. М.: Academia, 2001. P. 492-517. // A grammar sketch of the
Catalan language.
Alòs i Font 2015, Llorens 2015, Cardona 2015, Barreda 2015, Pérez i Sanchis 2015, Montagut 2015,
Balletbò 2015 = The majority of data was provided by Mr. Hèctor Alòs i Font, Mr. Josep
Llorens (Central dialect, Barcelona), Mr. Ciceró Pascual Cardona (Maon Minorcan
subdialect), Mr. David Barreda (a subdialect of Castelló de la Plana), Mr. Guillermo Pérez
Sanchis (a subdialect of Valencia), Mr. Toni Pedrós (a subdialect of Manises) Mr. Abel
Montagut (a subdialect of Les Garrigues) and Mr. Eloi Balletbò (a subdialect of Oliana) who
kindly agreed to fill in the questionnaire and gave some further comments on Catalan.
References & sources (Spanish)
Corominas 1997 = Corominas, J. Breve diccionario etimológico de la lengua castellana. Madrid:
Gredos. // An etymological dictionary of the Spanish language.
Narumov 2001 = Нарумов, Б. П. Испанский язык [Narumov B.P. Spanish language]. In: Языки
мира. Романские языки. М.: Academia, pp. 411-462. // A grammar sketch of the Spanish
language.
Valén 2015 = Information provided by Mr. Antonio Valén, a native speaker of Castilian Spanish (a
subdialect of Soria), who filled the questionnaire and gave some further comments.
References & sources (Asturian)
Narumov 2001 = Нарумов, Б. П. Астурийский язык [Narumov, B. P. Asturian language]. In:
Языки мира. Романские языки. М.: Academia, pp. 535-550. // A grammar sketch of the
Asturian language.
Riego-Delgado 2016, González Rato 2016 = Information kindly provided by Mr. Abraham Riego-
Delgado (Candás, Central dialect) and Mr. Xesús González Rato (Langreo, Central dialect),
who filled the questionnaire and gave some further comments. Unfortunately, neither of the
informants answered all our additional questions, which could have been very useful in some
difficult cases.
References & sources (Standard Portuguese)
Costa & Melo 1975 = Almeida Costa, J., Sampaio e Melo, E. Dicionário da língua portuguesa.
Porto: Porto Editora. // Portuguese explanatory dictionary.
Cândido de Figueiredo = Cândido de Figueiredo, A. Dicionário da língua portuguesa. Amadoro:
Bertrand, 1978 // Portuguese explanatory dictionary in two volumes.
Pimentel Ferreira 2016 = Information provided by Mr. João Pimentel Ferreira (a native speaker of
Lisboa Portuguese), who was so kind to comment on some sophisticated cases.
Feyershtein & Starets 2005 = Феерштейн, Е. Н., Старец, С. М. Большой португальско-русский
словарь [Feyershtein Y. N., Starets S. M. Big Portuguese-Russian dictionary]. М: Живой
язык. // Comprehensive Portuguese -Russian dictionary.
Nascentes 1955 = Nascentes, A. Dicionário etimológico da língua portuguêsa. Rio de Janeiro. //
Etymological dictionary of the Portuguese language.
Nechayeva 2009 = Нечаева, К. К. Португальский язык. Справочник по грамматике [Nechayeva
K. K. Portuguese language. Grammar reference book]. М: Живой язык. // A grammar sketch
of the Portuguese language.
Voinova et al. 1989 = Воинова, Н. Я., Старец, С. М., Верхуша В. М., Здитовецкий А. Г. Русско-
португальский словарь [Voinova N. Y., Starets S. M., Verkhusha V. M., Zditovetskiy A.
G. Russian Portuguese dictionary]. М.: Русский язык. // Comprehensive Russian-
Portuguese dictionary.
Volf 2001 = Вольф, Е. М. Португальский язык [Volf Y. M. Portuguese language]. In: Языки
мира. Романские языки. М.: Academia, pp. 462-491. // A sketch of the Portuguese
language.
CdP 2016 = O Corpus do Português, compiled by M. Davies and M.J. Ferreira. Available at:
http://www.corpusdoportugues.org/x.asp // A Portuguese Corpus.
References & sources (Galician)
Carré Alvarellos 1972 = Carré Alvarellos L. Diccionario galego-castelán e vocavulario castelán-
galego. La Coruna: Moret // Small Galician-Spanish and Spanish-Galician dictionary.
DRAG 2016 = Dicionario da Real Academia Galega. Available at:
http://academia.gal/dicionario#inicio.do // Galician explanatory dictionary.
Fernández Armesto 1981 = Fernández Armesto F. Dicionario castelán-galego. Sada (A Coruña):
Ediciós do Castro // Small Spanish-Galician dictionary.
Franco Grande 1968 = Franco Grande X. L. Diccionario Galego-Castelan e vocabulario Castelan-
Galego. Vigo: Galaxia // Small Galician-Spanish and Spanish-Galician dictionary.
Montoya Bolaños 2015, 2016 = Information provided by Mr. Alejandro Montoya Bolaños (living in
Lugo, native speaker of Central Galician dialect) who kindly agreed to fill in the
questionnaire and gave some further comments on sophisticated cases.
Narumov 2001 = Нарумов, Б. П. Галисийский язык [Narumov B. P. Galician
language]. In: Языки мира. Романские языки. М.: Academia, pp. 517-534. // A grammar
sketch of the Galician language.
References & sources (Occitan)
Coupier 1995 = Coupier J. Dictionnaire Français Provençal. Gignac-la-Nerthe: Association
Dictionnaire Français Provençal. // Comprehensive French-Provençal dictionary.
Lobodanov, Morozova, Chelysheva 2001 = Лободанов, А. П., Морозова, Е. В., Челышева, И. И.
Окситанский язык [Lobodanov A. P., Morozova E. V., Chelysheva I. I. Occitan language].
// Языки мира. Романские языки. М.: Academia, 2001. P. 278-303. // A grammar sketch of
the Occitan language.
References & sources (Franco-Provençal)
Viret 2013 = Viret, R. Dikchonéro Fransé-Savoyв. Dictionnaire Français-Savoyard. //
Comprehensive French- Savoyard Franco-Provençal dictionary. Available at:
http://www.arpitania.eu/aca/documents/Dictionnaire
VireFrancais
Savoyard.pdf
Virets dictionary contains rich material from many Savoyard subdialects. We have chosen the
subdialect of the Albanais region as the main source of data, while giving the forms of other
subdialects in comments in case different terms are used there.
References & sources (French)
Chelysheva 2001 = Челышева, И. И. Старофранцузский язык [Chelysheva, I. I, Old French
language]. In: Языки мира. Романские языки. М.: Academia, pp. 250-278. // A grammar
sketch of the Old French language.
Collins-Robert 1989 = Collins-Robert French-English, English-French dictionary. Paris: Le Robert.
// Comprehensive French-English and English-French dictionary.
Einhorn 1999 = Einhorn, E. Old French. A concise handbook. Cambridge: University Press.
EDCT 2014 = Electronic Dictionary of Chrétien de Troyes, created by Pierre
Kunstmann (University of Ottawa), Hiltrud Gerner (Université de Lorraine) and May
Plouzeau (Université de Provence). Available at
http://www.atilf.fr/dect/lexique/DECEnglis20141201.pdf // A comprehensive dictionary of
Chrétien de Troyes language, including data on frequency of attestation.
Godefroy = Godefroy, F. Dictionnaire de lancienne langue française et tous ses dialects du IXe et
XVe siècle. Genève, Paris: Slatkine, 1982. // An Old French dictionary in 10 volumes.
LGR = Le grand Robert de la langue française. Paris: Le Robert, 1986. // Comprehensive French
explanatory dictionary in 9 volumes.
Lonsdale & Le Bras 2009 = Lonsdale, D., Le Bras, Y. A Frequency Dictionary of French. London,
New York: Routledge. // Short French frequency dictionary, including 5 000 words and
based on 23 millions words corpus.
Rayevskaya 2013 = Раевская, О. В. Новый французско-русский, русско-французский словарь
[Rayevskaya O. V. New French-Russian, Russian-French dictionary]. М.: ABYY Press. //
Medium dictionary (about 40 000 words and expressions).
Referovskaya et al. 2001 = Реферовская, Е. А.; Бокадорова Н. Ю.; Гулыга О. А.; Челышева И.
И. Французский язык [Referovskaya Y. A.; Bokadorova N. Y.; Gulyga O. A.; Chelysheva,
I. I, French language]. In: Языки мира. Романские языки. М.: Academia, pp. 194-249. // A
grammar sketch of the French language.
Robert-Collins 1989 = Robert-Collins dictionnaire français-anglais, anglais-français. Paris: Le
Robert. // Comprehensive French-English and English-French dictionary.
References & sources (Picard)
Leplubo 2016 = Information provided by Mr. Jean-Claude Leplubo, a native speaker of Picard (a
subdialect of Grandvilliers), who filled the questionnaire and gave comments where
necessary.
References & sources (Walloon)
Mahin 2016 = Information provided by Mr. Lucien Mahin, who kindly filled the questionnaire in his
native Transinne Walloon (Southern dialect) and in standardized Rifondou Walloon, and was
so kind to comment on some sophisticated cases.
Authors:
Megleno Romanian: Mikhail Saenko, February 2015. Revised March 2015 (minor corrections).
Revised June 2016.
Istro Romanian, Aromanian: Mikhail Saenko, March 2015. Revised June 2016.
Romanian: Mikhail Saenko, March 2015. Revised April 2016 (some comments on the etymology
are added). Revised June 2016.
Dalmatian: Mikhail Saenko, April 2015. Revised February 2016 (minor corrections). Revised June
2016.
Ladinː Mikhail Saenko, April 2015. Revised June 2016.
Friulianː Mikhail Saenko, April 2015. Revised March 2016 (minor corrections). Revised June
2016.
Romansh, Emiliano-Romagnol: Mikhail Saenko, May 2015. Revised June 2016.
Piemontese: Mikhail Saenko, May 2015. Revised May 2016 (Turinese list added). Revised June
2016.
Ligurian, Italian, Logudorese, Campidanese, Lombard, Neapolitan: Mikhail Saenko, June 2015.
Revised June 2016.
Venetian: Mikhail Saenko, July 2015. Revised February 2016 (minor corrections). Revised June
2016.
Spanish, Occitan: Mikhail Saenko, July 2015. Revised June 2016.
Catalan: Mikhail Saenko, July 2015. Revised February 2016 (minor corrections). Revised June
2016.
Sicilian: Mikhail Saenko, July 2015. Revised March 2016 (minor transliterational corrections).
Revised July 2016 (the Buscemese data were added).
French: Mikhail Saenko, February 2016. Revised June 2016.
Portugueseː Mikhail Saenko, February 2016. Revised March 2016 (some refinements). Revised
June 2016.
Galician, Picard: Mikhail Saenko, March 2016. Revised June 2016.
Foligno Italian, Old Italian: Mikhail Saenko, April 2016. Revised June 2016.
Old French, Walloon, Franco-Provençal: Mikhail Saenko, May 2016. Revised June 2016.
Latin, Asturian: Mikhail Saenko, June 2016.