2021
Incorporating tone in the calculation of phonotactic probability. James Kirby.
Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Bangkok, 32-38. 2021.
[ pdf
| bib
| abstract
| supplementary materials
]
This paper investigates how the ordering of tone relative to the segmental string influences the calculation of phonotactic probability. Trigram and recurrent neural network models were trained on syllable lexicons of four Asian syllable-tone languages (Mandarin, Thai, Vietnamese, and Cantonese) in which tone was treated as a segment occurring in different positions in the string. For trigram models, the optimal permutation interacted with language, while neural network models were relatively unaffected by tone position in all languages. In addition to providing a baseline for future evaluation, these results suggest that phonotactic probability is robust to choices of how tone is ordered with respect to other elements in the syllable.
@inproceedings { kirby2021incorporating,
title = {Incorporating tone in the calculation of phonotactic probability},
booktitle = {Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology},
address = {Bangkok},
author = {James Kirby},
year = {2021},
pages = {32--38}
}
Individuals, communities, and sound change: an introduction. Lauren Hall-Lew, Patrick Honeybone, James Kirby.
Glossa: A journal of general linguistics 6(1), 1-17. 2021.
[ bib
| abstract
| publisher's site
]
Do individual differences affect sound change? Traditional approaches to phonetic and phonological change typically downplay differences between the individuals who make up a speech community that is undergoing change, but this has been questioned in recent years in a number of ways from within several distinct traditions of research. The articles in the Glossa Special Collection to which this article is an introduction consider the extent to which individual differences (at a psychological, sociological, physiological, genetic and/or behavioral level) between the members of a speech community might or might not be important in explaining the general properties of sound change. This introduction places these articles in context, considers what we might mean by 'sound change' and 'individual differences', and aims to build a synthesis of the current research landscape in the area.
@article { hall-lew2021individuals,
title = {Individuals, communities, and sound change: an introduction},
author = {Lauren Hall-Lew and Patrick Honeybone and James Kirby},
year = {2021},
pages = {1--17},
volume = {6},
issue = {1},
doi = {http://doi.org/10.5334/gjgl.1630}
}
Relating production and perception of L2 tone. James Kirby, Đinh Lư Giang.
In Ratree Wayland (ed.), Second language speech learning: Theoretical and empirical progress. Cambridge University Press. 2021.
[ preprint
| bib
| abstract
| publisher's site
| supplementary materials
]
Research on production/perception relationship in a second language (L2) has focused chiefly on segmental contrasts. In the domain of lexical tone, studies of how production and perception are related at the level of the individual are rare. This paper considers the relation between production and perception of L2 tone in speakers of Kiên Giang Khmer, a non-tonal language, who are also fluent to varying degrees in Southern Vietnamese, a language with 5 lexical tones. In addition to directly comparing L2 to L1 performance in tonal production and perception, we explore how perception might be related to the internal organization of a speaker’s own production system by comparing distances between f0 curves to accuracy in a speeded AX discrimination task. Relative to native speakers of Southern Vietnamese, we found considerable individual variation among speakers of Kiên Giang Khmer with L2 knowledge of Vietnamese in the degree to which they approximated Vietnamese tonal targets. Production accuracy was most strongly related to age, while discrimination performance correlated best with education. In addition, we observed a weak correlation between the acoustic distance of a Khmer speaker’s production of tone T to the native Vietnamese production of T, and the ability to discriminate tone T from other tones. However, speakers who acoustically separated two tones in their own productions were also more accurate at discriminating those tones in perception, regardless of how well those productions approximated native speaker targets.
@InCollection{ kirby2021relating,
booktitle = {Second langauge speech learning: Theoretical and empirical progress},
address = {Cambridge},
author = {James Kirby and Đinh Lư Giang},
publisher = {Cambridge University Press},
year = {2021},
pages = {249--272},
chapter = {9},
title = {Relating production and perception of L2 tone},
editor = {Ratree Wayland},
doi = {https://doi.org/10.1017/9781108886901}
}
Towards a comparative history of tonal text-setting practices in Southeast Asia. James Kirby.
In Reinhard Strohm (ed.), Transcultural music history (pp. 291-312). Berlin: VWB-Verlag. 2021.
[ preprint
| bib
| abstract
| publisher's site
]
In this chapter, I present the beginnings of a systematic investigation into the extent to which different tone languages, and different genres, adhere to principles of alignment between tone and melody. Particular emphasis is placed on attempting to determine the degree to which adherence to principles of text-setting, which govern how a composer may assign notes to words, are a function of a particular linguistic (as opposed to musical) tradition. The question is not simply if there are different textsetting principles active in different languages, which much previous research has shown to be the case. Rather, I seek to determine if there is evidence that principles active in a given language persist across time and genre.
@InCollection{ kirby2021comparative,
booktitle = {Transcultural music history},
address = {Berlin},
author = {James Kirby},
publisher = {VWB-Verlag},
pages = {291--312},
year = {2021},
title = {Towards a comparative history of tonal text-setting practices in Southeast Asia},
editor = {Reinhard Strohm}
}
2020
Prosody across the world: Mainland Southeast Asia. Marc Brunelle, James Kirby, Alexis Michaud, Justin Watkins.
In Carlos Gussenhoven and Aoju Chen (eds.), The Oxford handbook of language prosody (pp. 344-354). Oxford University Press. 2020.
[ preprint |
bib
| abstract
]
Mainland Southeast Asia is often viewed as a linguistic area where five different language phyla – Austroasiatic, Austronesian, Hmong-Mien, Sino-Tibetan and Kra-Dai – have converged typologically. This chapter illustrates areal features found in their prosodic systems, but also emphasizes their oft-understated diversity.
The first part of the chapter describes word level prosodic properties. A typology of word shapes and stress is first established: we revisit the concept of monosyllabicity, go over the notion of sesquisyllabicity (as typified by languages like Mon or Burmese) and discuss the realization of alternating stress in languages with polysyllabic words (such as Thai and Khmer). Special attention is then paid to tonation. Although many well-known languages of the area have sizeable inventories of complex tone contours, languages with few or no tones are common (20% being atonal). Importantly, the phonetic realization of tone frequently involves more than simply pitch: properties like phonation and duration often play a role in signaling tonal contrasts, along with less expected properties like onset voicing and vowel quality. We also show that complex tone alternations (spreading, neutralization and sandhi processes), although not typical, are well-attested.
The second part of the chapter addresses the less well-understood topic of phrasal prosody: prosodic phrasing and intonation. We reconsider the question of the amount of conventionalized intonation in languages with complex tone paradigms and pervasive final particles. We also show that information structure is often conveyed by means of overt markers and syntactic restructuring, but that it can also be marked by means of intonational strategies.
@InCollection{ brunelle2020mainland
booktitle = {The Oxford handbook of language prosody},
address = {Oxford},
author = {Marc Brunelle and James Kirby and Alexis Michaud and Justin Watkins},
chapter = {23},
pages = {344--354},
publisher = {Oxford University Press},
year = {2020},
title = {Prosody across the world: Mainland Southeast Asia},
editor = {Carlos Gussenhoven and Aoju Chen}
}
Tone-melody matching in tone language singing. D. Robert Ladd, James Kirby.
In Carlos Gussenhoven and Aoju Chen (eds.), The Oxford handbook of language prosody (pp. 676-687). Oxford University Press. 2020.
[
preprint
| bib
| abstract
]
Singing in tone languages, a perennial source of mystery to speakers of non-tonal languages, has been the subject of a good deal of research since the turn of the century. This research shows that text-setting constraints are the heart of the solution to respecting both the linguistic and the musical functions of pitch. Specifically, in most of the 15 or 20 Asian and African tone languages where the question has been studied, the most important principle in maintaining intelligibility of song texts seems to be the avoidance of what we call contrary settings: musical pitch movement up or down from one syllable to the next should not be the opposite of the linguistically specified pitch direction. We review the variations on this theme that have been described in the recent literature, including differences between languages and musical genres in how strictly the constraint is observed, and other phonetic resources used to signal tonal distinctions in singing. We briefly consider two more general issues: (1) how tonal text-setting might be incorporated into a general theory that includes traditional European metrics, and (2) what the avoidance of contrary settings tells us about the phonological essence of tonal contrasts.
@InCollection{ ladd2020tone,
booktitle = {The Oxford handbook of language prosody},
address = {Oxford},
author = {D. Robert Ladd and James Kirby},
chapter = {49},
pages = {676--687},
publisher = {Oxford University Press},
year = {2020},
title = {Tone-melody matching in tone language singing},
editor = {Carlos Gussenhoven and Aoju Chen}
}
Transphonologization of voicing in Chru: Studies in production and perception. Marc Brunelle, Tạ Thành Tấn, James Kirby, Đinh Lư Giang.
Laboratory Phonology 11 (1), 15. 2020.
[ Publisher's site |
bib
| abstract
]
Chru, a Chamic language of south-central Vietnam, has been described as combining contrastive obstruent voicing with incipient registral properties (Fuller, 1977). A production study reveals that obstruent voicing has already become optional and that the voicing contrast has been transphonologized into a register contrast based primarily on vowel height (F1). An identification study shows that perception roughly matches production in that F1 is the main perceptual cue associated with the contrast. Structured variation in production suggests a sound change still in progress: While younger speakers largely rely on vowel height to produce the register contrast, older male speakers maintain a variety of secondary properties, including optional closure voicing. Our results shed light on the initial stages of register formation and challenge the claim that register languages must go through a stage in which breathiness or aspiration is the primary contrastive property (Haudricourt, 1965; Wayland & Jongman, 2002; Thurgood, 2002). This article also complements several recent studies about the transphonologization of voicing in typologically diverse languages (Svantesson & House, 2006; Howe, 2017; Coetzee, Beddor, Shedden, Styler, & Wissing, 2018).
@article{ brunelle2020transphonologization,
author={Marc Brunelle and Tạ Thành Tấn and James Kirby and Đinh Lư Giang},
title={Transphonologization of voicing in Chru: Studies in production and perception},
year={2020},
journal={Laboratory Phonology},
volume={11},
issue={1},
pages={15}
}
Toward open data policies in phonetics: what we can gain and how we can avoid pitfalls. Marc Garellek, Matthew Gordon, James Kirby, Wai-Sum Lee, Alexis Michaud, Christine Mooshammer, Oliver Niebuhr, Daniel Recasens, Timo B. Roettger, Adrian Simpson, Kristine M. Yu.
Journal of Speech Science 9, 3-16. 2020.
[ Publisher's site |
bib
| abstract
]
It is not yet standard practice in phonetics to provide access to audio files along with submissions to journals. This is paradoxical in view of the importance of data for phonetic research: from audio signals to the whole range of data acquired in phonetic experiments. The phonetic sciences stand to gain greatly from data availability: what is at stake is no less than reproducibility and cumulative progress. We will argue that a collective turn to Open Science holds great promise for phonetics. First, simple reflections on why access to primary data matters are recapitulated and proposed as a basis for consensus. Next, possible drawbacks of data availability are addressed. Finally, we argue that data curation and archiving are to be recognized as part of the same activity that results in the publication of research papers, rather than attempting to build a parallel system to incentivize data archiving by itself.
@article{ garellek2020towards,
author={Marc Garellek and Matthew Gordon and James Kirby and Wai-Sum Lee and Alexis Michaud and Christine Mooshammer and Oliver Niebuhr and Daniel Recasens and Timo B. Roettger and Adrian Simpson and Kristine M. Yu},
title={Toward open data policies in phonetics: what we can gain and how we can avoid pitfalls},
year={2020},
journal={Journal of Speech Science},
volume={9},
pages={3--16}
}
Elicitation context does not drive F0 lowering following voiced stops: Evidence from French and Italian. James Kirby, Bob Ladd, Jiayin Gao, Zuzana Elliott.
JASA Express Letters 148(2), EL137-EL152. 2020.
[ Publisher's site |
bib
| abstract | supplementary materials
]
Consonant-intrinsic F0 (CF0) effects are mainly the result of raising F0 following voiceless obstruents, rather than of lowering F0 following voiced obstruents. However, there are also documented instances where lowered F0 following voiced obstruents is enhanced. Given that both voicing and F0 are affected by prosodic context, it is possible that CF0 is lowered in some contexts but not others. This possibility is investigated by examining CF0 in French and Italian in isolated citation forms. Results are comparable to carrier-phrase contexts, where no F0 lowering after voiced obstruents is observed. Possible sources of the apparent cross-linguistic differences are discussed.
@article{ kirby2020elicitation,
author={James Kirby and D. Robert Ladd and Jiayin Gao and Zuzana Elliott},
title={Elicitation context does not drive F0 lowering following voiced stops: Evidence from French and Italian},
year=2020,
journal={JASA Express Letters},
volume={148},
number={2},
pages={EL147--EL152},
doi={10.1121/10.0001698}
}
The role of F0 and phonation cues in Cantonese low tone perception. Yubin Zhang, James Kirby.
JASA Express Letters 148(1), EL40-EL45. 2020.
[ Publisher's site |
bib
| abstract | supplementary materials
]
For languages that primarily exploit F0 to signal tonal contrast, the role of phonation cues in tonal perception remains controversial. This study revisits the use of F0 and phonation cues in Cantonese low tone perception (tone 4, 21/tone 6, 22) using synthesized stimuli. In line with previous studies, F0 contour and height were found to be the most salient cues, with F0 height being more important. The effects of non-modal phonation (creaky and breathy voice) were relatively small. Non-modal phonation enhanced low tone perception only in the low F0 range. The results are consistent with the differential integration hypothesis that the perceptual role of phonation is dependent on F0 and that phonation cues integrate with F0 differently depending on F0 height.
@article{ zhang2020role,
author={Yubin Zhang and James Kirby},
title={The role of {F}0 and phonation cues in {C}antonese low tone perception},
year=2020,
journal={JASA Express Letters},
volume={148},
number={1},
pages={EL40--EL45},
doi={10.1121/10.0001523}
}
Effects of prosodic prominence on obstruent-intrinsic F0 and VOT in German. James Kirby, Felicitas Kleber, Jessica Siddins, Jonathan Harrington.
Proceedings of the 10th International Conference on Speech Prosody, Tokyo, 210-214. 2020.
[ pdf |
bib
| abstract
]
We consider how lexical stress and phrasal accent influence the acoustic realization of cues to phonological voicing in German plosives. 22 native speakers of Standard German were recorded producing a total of 3168 utterances in both strong (stressed/focused) and weak (unstressed/unfocused) prosodic contexts, while holding prosodic domain constant. Both Voice Onset Time (VOT) and obstruent-intrinsic F0 (CF0) were analyzed. We found that differences in the magnitude of CF0 between voiced and voiceless plosives were greatest in the strong prosodic context, but were not always obliterated in the weak prosodic context. However, individual differences were also observed, with speakers broadly patterning into four groups with respect to the interaction of micro- and macroprosody. VOT differences were also more pronounced in strong prosodic contexts. We consider the implications of our findings for sound changes involving the reanalysis of obstruent-intrinsic F0.
@incollection{ kirby2020effects,
author={James Kirby and Felicitas Kleber and Jessica Siddins and Jonathan Harrington},
title={{Effects of prosodic prominence on obstruent-intrinsic F0 and VOT in German}},
year=2020,
booktitle={Proc. 10th International Conference on Speech Prosody 2020},
pages={210--214},
doi={10.21437/SpeechProsody.2020-43},
url={http://dx.doi.org/10.21437/SpeechProsody.2020-43}
}
Acoustic correlates of plosive voicing in Madurese. Misnadin, James Kirby.
Journal of the Acoustical Society of America 147(4): 2779-2790. 2020.
[
pdf |
bib |
abstract
| supplementary materials | publisher's site
]
Madurese, a Malayo-Polynesian language of Indonesia, is of interest both areally and typologically: it is described as having a three-way laryngeal contrast between voiced, voiceless unaspirated, and voiceless aspirated plosives, along with a strict phonotactic restriction on consonant voicing-vowel height sequences. We present an acoustic analysis of Madurese consonants and vowels obtained from recordings of fifteen speakers, to assess whether its voiced and aspirated plosives might share acoustic properties indicative of a shared articulatory gesture. Although we find that voiced and voiceless aspirated plosives in word-initial position pattern together in terms of several spectral balance measures, these are most likely due to the following vowel quality, rather than aspects of a shared laryngeal configuration. Conversely, the voiceless (aspirated and unaspirated) plosives share multiple acoustic properties, including F0 trajectories and overlapping voicing lag time distributions, suggesting that they share a glottal aperture target. We discuss the implications of these findings for the typology of laryngeal contrasts and the historical evolution of the Madurese consonant-vowel co-occurrence restriction.
@article{ misnadin2020acoustic,
author = {Misnadin and James Kirby},
doi = {10.1121/10.0000992},
journal = {Journal of the Acoustical Society of America},
number = {4},
pages = {2779--2790},
title = {Acoustic correlates of plosive voicing in {M}adurese},
volume = {147},
year = {2020}
}
2019
Effects of obstruent voicing on vowel F0: implications for laryngeal realism. James Kirby, D. Robert Ladd.
Yearbook of Poznań Linguistic Meeting 4 (2018), 213-235. 2019.
[ Publisher's site
| bib
| abstract
]
It is sometimes argued that languages with two-way laryngeal contrasts can be classified according to whether one series is realized canonically with voicing lead or the other with voicing lag. In languages of the first type, such as French, the phonologically relevant features is argued to be [voice], while in languages of the second type, such as German, the relevant feature is argued to be [spread glottis]. A crucial assumption of this position is that the presence of certain contextually stable phonetic cues, namely voicing lead or lag, can be used to diagnose the which feature is phonologically active.
In this paper, we present data on obstruent-intrinsic F0 perturbations (CF0) in two [voice] languages, French and Italian. Voiceless obstruents in both languages are found to raise F0, while F0 following (pre)voiced obstruents patterns together with sonorants, similar to the voiceless unaspirated stops of [spread glottis] languages like German and English. The contextual stability of this cue implies that an active devoicing gesture is common to languages of both the [voice] and [spread glottis] types, and undermines the idea that a strict binary dichotomy between true voicing and aspirating languages can be reliably inferred based on properties of the surface phonetics.
@article{ kirby2019effects,
author = {James Kirby and D. Robert Ladd},
doi = {10.2478/yplm-2018-0009},
journal = {Yearbook of the Poznań Linguistic Meeting},
pages = {213--235},
year = {2019},
title = {Effects of obstruent voicing on vowel F0: implications for laryngeal realism},
volume = {4 (2018)}
}
Obstruent devoicing and registrogenesis in Chru. Marc Brunelle, Tạ Thành Tấn, James Kirby, Đinh Lư Giang.
Proceedings of the Nineteenth International Congress of Phonetic Sciences, Melbourne. 2019.
[ pdf |
bib
| abstract
]
We describe the register system of Chru, a Chamic language of Vietnam. In Chru, a historical contrast between prevoiced and voiceless stops is now a system of two registers signalled by differences in f0, voice quality, and F1 in addition to closure voicing. However, closure voicing is in a state of flux: while older men maintain closure voicing in the onsets of low-register items, younger speakers and some older women frequently have no (or only weak) closure voicing in this context. In addition, the distribution of VOT in low register onsets is bimodal, realized either with strong closure voicing or greater VOT than voiceless stops. Interestingly, f0, F1 and voice quality cues are not enhanced after devoiced low-register stops, but instead are more pronounced after stops realized with closure voicing. We argue this indicates that enhancement of cues in phonologization must in some sense be complete before neutralization takes place.
@incollection{ brunelle2019obstruent,
author = {Marc Brunelle and Tạ Thành Tấn and James Kirby and Đinh Lư Giang},
year = {2019},
title = {Obstruent devoicing and registrogenesis in Chru},
editor = {Sasha Calhoun and Paola Escudero and Marija Tabain and Paul Warren},
pages = {517--521},
publisher = {Australasian Speech Science and Technology Association Inc.},
booktitle = {Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019},
address = {Canberra}
}
Perception of laryngeal contrast in Madurese. James Kirby, Misnadin.
Proceedings of the Nineteenth International Congress of Phonetic Sciences, Melbourne. 2019.
[ pdf |
bib
| abstract
]
We investigate native speaker perception of cues to voiceless plosives in the Malayo-Polynesian language Madurese. Madurese is described as having a three-way laryngeal contrast between voiced, voiceless aspirated, and voiceless unaspirated plosives. However, voiceless aspirated and unaspirated plosives are always followed by vowels of different but predictable height, and their VOT distributions overlap heavily, raising the question of whether VOT or F1 is primary perceptual cue to this contrast. The trading relation between VOT and F1 in Madurese was investigated using 2AFC identification and AXB discrimination paradigms. Results indicate that the VOT differences between voiceless plosives which exist in production are not exploited in perception, suggesting that Madurese speakers may not have distinct phonetic targets for aspirated and unaspirated plosives. The surface VOT distributions may instead be a result of differences in following vowel height.
@incollection{ kirby2019perception,
author = {James Kirby and Misnadin},
year = {2019},
title = {Perception of laryngeal contrast in Madurese},
editor = {Sasha Calhoun and Paola Escudero and Marija Tabain and Paul Warren},
pages = {2378--2382},
publisher = {Australasian Speech Science and Technology Association Inc.},
booktitle = {Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019},
address = {Canberra}
}
Acoustic analysis of onset voicing in Dzongkha obstruents. James Kirby, Gwendolyn Hyslop.
Proceedings of the Nineteenth International Congress of Phonetic Sciences, Melbourne. 2019.
[ pdf |
bib
| abstract
]
We present an acoustic analysis of cues to onset voicing in Dzongkha, the national language of Bhutan. Dzongkha is typically described as having a four-way laryngeal contrast between aspirated, unaspirated, prevoiced and devoiced obstruents. Previous descriptions suggest that this system may be changing, with the devoiced series either merging with the voiced series, or losing closure voicing but retaining contrastive pitch and/or voice quality. Based on data from 12 speakers, we find voiced and devoiced plosives are realised both with and without voicing lead. Tokens realized as phonetically voiced can be redundantly breathy; however, a low register tone always occurs on syllables headed by both voiced and devoiced obstruents, regardless of presence or absence of voicing lead. We discuss the implications of these findings for models of tonogenesis and historical sound change in the Tibeto-Burman context.
@incollection{ kirby2019acoustic,
author = {James Kirby and Gwendolyn Hyslop},
year = {2019},
title = {Acoustic analysis of onset voicing in Dzongkha obstruents},
editor = {Sasha Calhoun and Paola Escudero and Marija Tabain and Paul Warren},
pages = {3607--3611},
publisher = {Australasian Speech Science and Technology Association Inc.},
booktitle = {Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019},
address = {Canberra}
}
An acoustic study of Zhajin Gan tone. Yingyi Zhou, James Kirby.
Proceedings of the Nineteenth International Congress of Phonetic Sciences, Melbourne. 2019.
[ pdf |
bib
| abstract
]
This paper presents the first phonetic study of the tone system of Zhajin Gan. While important for our understanding of tonogenesis in general and Chinese historical phonology in particular, the tone systems of Gan dialects have not been analysed acoustically. Zhajin Gan can be analysed as having 6 or 7 tones in non-checked syllables, with most having two allotones conditioned by historical differences in onset type. However, as synchronic cues to the laryngeal contrast have weakened or disappeared, the previously redundant onset f0 differences are being phonologized, setting the stage for additional tone splits. In Zhajin Gan, plosives and affricates from MC Ciqing and Quanzhuo series are synchronically realized as lax voiced stops, correlating with lower onset f0, but without any evidence for synchronic aspiration. We discuss the possible role that non-modal phonation may have played in the evolution of this complex tone system.
@incollection{ zhou2019acoustic,
author = {Yingyi Zhou and James Kirby},
year = {2019},
title = {An acoustic study of Zhajin Gan tone},
editor = {Sasha Calhoun and Paola Escudero and Marija Tabain and Paul Warren},
pages = {1193--1197},
publisher = {Australasian Speech Science and Technology Association Inc.},
booktitle = {Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019},
address = {Canberra}
}
2018
Madurese. Misnadin, James Kirby.
Journal of the International Phonetic Association 50(1), 109-126. 2018 (2020).
[
publisher's site |
bib
| supplementary materials
]
@article{ misnadin2018tadurese,
author = {Misnadin and James Kirby},
journal = {Journal of the International Phonetic Association},
year = {2020},
volume = {50},
issue = {1},
pages = {109--126},
doi = {10.1017/S0025100318000257}
title = {Madurese},
}
Inducing a lexicon of sociolinguistic variables from code-mixed text. Philippa Shoemark, James Kirby, Sharon Goldwater.
Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User-generated Text, 1-6. 2018.
[ pdf | bib
| abstract
]
Sociolinguistics is often concerned with how variants of a linguistic item (e.g., nothing vs. nothin’) are used by different groups or in different situations. We introduce the task of inducing lexical variables from code-mixed text: that is, identifying equivalence pairs such as (football, fitba) along with their linguis- tic code (football→British, fitba→Scottish). We adapt a framework for identifying gender-biased word pairs to this new task, and present results on three different pairs of English dialects, using tweets as the code-mixed text. Our system achieves precision of over 70% for two of these three datasets, and produces useful results even without extensive parameter tuning. Our success in adapting this framework from gender to language variety suggests that it could be used to discover other types of analogous pairs as well.
@inproceedings{ shoemark2018inducing,
title={Inducing a lexicon of sociolinguistic variables from code-mixed text},
booktitle={Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User-generated Text},
author={Shoemark, Philippa and Kirby, James and Goldwater, Sharon},
editor={Wei Xu and Alan Ritter and Tim Baldwin and Afshin Rahimi},
year={2018},
publisher={Association for Computational Linguistics},
pages={1-6}
}
Onset pitch perturbations and the cross-linguistic implementation of voicing: Evidence from tonal and non-tonal languages. James Kirby.
Journal of Phonetics 71, 326-354. 2018
[ publisher's site |
bib
| abstract
| supplementary materials
| corrigendum
]
This paper investigates the relationship between Voice Onset Time (VOT) and onset f0 perturbations in three languages with a three-way laryngeal contrast between prevoiced, short-lag, and long-lag stops. To assess the relative contributions of aspiration and tonality to the realization of onset f0, a non-tonal language (Khmer) is compared to two tonal languages (Central Thai and Northern Vietnamese) using a common set of methods and materials. While the VOT distributions of the three languages are extremely similar, they differ in terms of their onset f0 behavior. Aspirated stops in general condition higher f0 on the following vowel, but this effect is mediated by tonal and sentential context: it is more prominent in citation forms than in connected speech, and for the tone languages, it is more visible with higher as opposed to lower tones. Examination of individual differences suggests that speakers may differ systematically in terms of their laryngeal adjustments for expressing voicelessness even while maintaining similar timing relations as indicated by VOT. Onset f0 differences may serve a useful complement to VOT, particularly when reasoning about the cross-linguistic implementation of voicing.
@article{ kirby2018onset,
author = {James P. Kirby},
journal = {Journal of Phonetics},
pages = {326--354},
volume = {71},
year = {2018},
title = {Onset pitch perturbations and the cross-linguistic implementation of voicing: Evidence from tonal and non-tonal languages}
}
Mixed-effects design analysis for experimental phonetics. James Kirby, Morgan Sonderegger.
Journal of Phonetics 70, 70-85. 2018
[ preprint |
publisher's site |
bib
| abstract
| supplementary materials
]
It is common practice in the statistical analysis of phonetic data to draw conclusions on the basis of statistical significance. While p-values reflect the probability of incorrectly concluding a null effect is real, they do not provide information about other types of error that are also important for interpreting statistical results. In this paper, we focus on three measures related to these errors. The first, power, reflects the likelihood of detecting an effect that in fact exists. The second and third, Type M and Type S errors, measure the extent to which estimates of the magnitude and direction of an effect are inaccurate. We then provide an example of design analysis (Gelman & Carlin, 2014), using data from an experimental study on German incomplete neutralization, to illustrate how power, magnitude, and sign errors vary with sample and effect size. This case study shows how the informativity of research findings can vary substantially in ways that are not always, or even usually, apparent on the basis of a p-value alone. We conclude by repeating three recommendations for good statistical practice in phonetics from best practices widely recommended for the social and behavioral sciences: report all results; design studies which will produce high-precision estimates; and conduct direct replications of previous findings.
@article{ kirby2018mixed,
author = {James Kirby and Morgan Sonderegger},
journal = {Journal of Phonetics},
pages = {70--85},
volume = {70},
year = {2018},
title = {Mixed-effects design analysis for experimental phonetics},
}
Model selection and phonological argumentation. James Kirby, Morgan Sonderegger.
In Diane Brentari and Jackson Lee (eds.), Shaping Phonology (pp. 234-252). University of Chicago Press. 2018.
[ preprint |
bib
| abstract
]
Statistical and empirical methods are in widespread use in present-day phonological research. Researchers are often interested in the problem of model selection, or determining whether or not a particular term in a model is statistically significant, in order to make a judgement about whether or not that term is theoretically significant. If a term is not significant, it is often tempting to conclude that it is not relevant. However, such inferences require an assessment of statistical power, a dimension independent from significance. Assessing power is more difficult than assessing significance because it depends on factors including the true (or expected) effect size, sample size, and degree of noise. In this paper, we provide a non-technical introduction to the issue of power, illustrated with simulations based on experimental investigations of incomplete neutralization, to illustrate how not all null results are equally informative. In particular, depending on the statistical power, a non-significant result can either be uninformative, or reasonably interpreted as providing evidence consistent with a small or zero effect.
@InCollection{ kirby2017model,
booktitle = {Shaping phonology},
address = {Chicago},
author = {James Kirby and Morgan Sonderegger},
pages = {234--252},
publisher = {University of Chicago Press},
year = {2018},
title = {Model selection and phonological argumentation},
editor = {Diane Brentari and Jackson Lee}
}
2017
On the r>h shift in Kiên Giang Khmer. James Kirby, Đinh Lư Giang.
Journal of the Southeast Asian Linguistics Society 10(2), 66-85. 2017.
[ pdf | bib
| abstract | audio recordings | supplementary materials
]
This paper presents an acoustic and perceptual study of the r>h shift in the variety of Khmer spoken in Giồng Riềng district, Kiên Giang province, Vietnam. In Phnom Penh Khmer, /r/ is realized as [h] in syllable onsets and onset clusters, and accompanied by lowered pitch, breathiness, and in some cases a change in the quality of the following vowel. In Kiên Giang Khmer, the r>h shift is accompanied by pitch lowering, but without changes in aspiration or vowel quality, and spectral measures did not indicate substantial differences in voice quality. Consistent with their productions, users of this dialect appear to rely solely on differences pitch to identify these lexical items. We discuss the implications of our findings for Khmer dialectology, mechanisms of sound change, and variation in the realization of rhotics more generally.
@article{ kirby2017shift,
title={On the r>h Shift in Kiên Giang Khmer},
journal={Journal of the Southeast Asian Linguistics Society},
author={Kirby, James and Đinh Lư Giang},
year={2017},
volume={10},
issue={2},
pages={66-85}
}
Topic and audience effects on distinctively Scottish vocabulary usage in Twitter data. Philippa Shoemark, James Kirby, Sharon Goldwater.
Proceedings of the Workshop on Stylistic Variation (EMNLP 2017), 59-68. 2017.
[ pdf | bib
| abstract
]
Sociolinguistic research suggests that speakers modulate their language style in response to their audience. Similar effects have recently been claimed to occur in the informal written context of Twitter, with users choosing less region-specific and non-standard vocabulary when addressing larger audiences. However, these studies have not carefully controlled for the possible confound of topic: that is, tweets addressed to a broad audience might also tend towards topics that engender a more formal style. In addition, it is not clear to what extent previous results generalize to different samples of users. Using mixed-effects models, we show that audience and topic have independent effects on the rate of distinctively Scottish usage in two demographically distinct Twitter user samples. However, not all effects are consistent between the two groups, underscoring the importance of replicating studies on distinct user samples before drawing strong conclustions from social media data.
@inproceedings{ shoemark2017topic,
title={Topic and audience effects on distinctively Scottish vocabulary usage in Twitter data},
booktitle={Proceedings of the Workshop on Stylistic Variation (EMNLP 2017)},
author={Shoemark, Philippa and Kirby, James and Goldwater, Sharon},
year={2017},
publisher={Association for Computational Linguistics},
pages={59-68}
}
Southeast Asian tone in areal perspective. James Kirby, Marc Brunelle.
In Raymond Hickey (ed.), The Cambridge handbook of areal linguistics (pp. 703-731). Cambridge University Press. 2017.
[ pdf |
bib
| abstract
| Publisher's site]
In this chapter, we address the role of contact in the evolution of tone in mainland Southeast Asia (MSEA). We present an overview of the phonetic, phonological, and genetic characteristics of MSEA tone systems, emphasizing the rich variability of tonal realization found in the region. Next, we discuss the ways in which languages can become tonal, reviewing evidence for the spread of tone through contact as well as for the idea that much of the observed tonality on the ground in modern MSEA might be traced to a small number of ‘tonogenetic events’ rather than a large number of borrowings. In light of this discussion, we consider whether a re-evaluation of the notion of tone as a canonical indicator of ‘linguistic area’ more generally is warranted.
@InCollection{ kirby2017areal,
booktitle = {The Cambridge handbook of areal linguistics},
address = {Cambridge},
author = {James Kirby and Marc Brunelle},
address = {Cambridge},
pages = {703-731},
publisher = {Cambridge University Press},
year = {2017},
title = {Southeast Asian tone in areal perspective},
editor = {Raymond Hickey}
}
Laryngeal contrasts in the Tai dialect of Cao Bằng. Pittayawat Pittayaporn, James Kirby.
Journal of the International Phonetic Association 47(1), 65-85. 2017.
[ pdf |
bib
| abstract
| supplementary materials | publisher's site
]
The Tai dialect spoken in Cao Bằng province, Vietnam, is at an intermediate stage between tonal register split and the accompanying transphonologization of a voicing contrast into a dual-register tone system. While the initial sonorants have completely lost their historical voicing distinction and developed a six-way tonal contrast, the obstruent series still preserves the original voicing contrast, leaving the tonal split incomplete. This paper presents the first acoustic study of tones and onsets in Cao Bằng Tai. Although f0, VOT, and voice quality were all found to play a role in the system of laryngeal contrasts, the three speakers considered varied in terms of the patterns of acoustic cues used to distinguish between onset types, particularly the breathy voiced onset /b̤/. From the diachronic perspective, our findings may help to explain why the reflex of modal prevoiced stops (*b) can be either aspirated or unaspirated voiceless stops.
@article{ pittayaporn2017laryngeal,
journal = {Journal of the International Phonetic Association},
author = {Pittayawat Pittayaporn and James Kirby},
year = {2017},
title = {Laryngeal contrasts in the Tai dialect of Cao Bằng},
volume = {47},
issue = {1},
pages = {65-85}
}
2016
Effects of obstruent voicing on vowel F0: evidence from "true voicing" languages. James Kirby, D. Robert Ladd.
Journal of the Acoustical Society of America 140(1), 2400-2411. 2016.
[ pdf |
bib
| abstract
| supplementary materials | publisher's site
]
This study investigates consonant-related F0 perturbations (“CF0”) in French and Italian by comparing the effects of voiced and voiceless obstruents on F0 to those of voiced sonorants. The voiceless obstruents /p f/ in both languages are found to have F0-raising properties similar to American English voiceless obstruents, while F0 following the (pre)voiced obstruents /b v/ in French and Italian patterns together with /m/, again similar to English [Hanson (2009). J. Acoust. Soc. Am. 125(1), 425–441]. In both languages, F0 is significantly depressed, relative to sonorants, during the closure for voiced obstruents, but cannot be differentiated from sonorants following the release of oral constriction. These findings are taken as support for a model on which F0 perturbations are fundamentally the result of laryngeal maneuvers initiated to sustain or inhibit phonation, regardless of other language-particular aspects of phonetic realization.
@article{ kirby2016effects,
author = {James Kirby and D. Robert Ladd},
doi = {10.1121/1.4962445},
journal = {Journal of the Acoustical Society of America},
number = {4},
pages = {2400-2411},
title = {Effects of obstruent voicing on vowel F0: evidence from "true voicing" languages},
volume = {140},
year = {2016}
}
Tone-melody correspondence in Vietnamese popular song. James Kirby, D. Robert Ladd.
In C. DiCanio et al. (eds.), The 5th International Symposium on Tonal Aspects of Languages (TAL2016), 48-51. 2016.
[ pdf |
bib
| abstract | supplementary materials | publisher's site
]
We examine the degree of correspondence between musical and linguistic tone sequences in a corpus of 20 Vietnamese popular songs. Our data suggest that text-setting constraints in Vietnamese, as in some other Asian tone languages, are based primarily on the direction of the pitch transition from one sylla- ble/note to the next. Borrowing some musical terminology, we may say that ‘similar motion’ is favoured, ‘oblique motion’ is allowed in certain cases, and ‘contrary motion’ is disfavoured. As with other Asian tone languages, the definition of these three types of motion depends on the categorisation of the linguistic tones; our current best model achieves a rate of 77% similar motion. We hypothesize that avoidance of contrary motion may be as or more important than achieving correspondence be- tween tonal and melodic transitions in Vietnamese popular song.
@incollection{ kirby2016tone,
address = {Buffalo, NY},
author = {James Kirby and D. Robert Ladd},
booktitle = {The 5th International Symposium on Tonal Aspects of Languages (TAL2016)},
doi = {10.21437/TAL.2016-10},
editor = {Christian DiCanio and Jeffrey Malins and Jeff Good and Karin Michelson and Jeri Jaeger and Holly Keily},
pages = {48--51},
title = {Tone-melody correspondence in Vietnamese popular song},
url = {http://dx.doi.org/10.21437/TAL.2016-10},
year = {2016}
}
Towards robust cross-linguisic comparisons of phonological networks. Philippa Shoemark, Sharon Goldwater, James Kirby, Rik Sarkar.
Proc. SIGMORPHON 14, 110-120. 2016.
[ pdf | bib
| abstract
]
Recent work has proposed using network science to analyse the structure of the men- tal lexicon by viewing words as nodes in a phonological network, with edges connect- ing words that differ by a single phoneme. Comparing the structure of phonological networks across different languages could provide insights into linguistic typology and the cognitive pressures that shape lan- guage acquisition, evolution, and processing. However, previous studies have not considered how statistics gathered from these networks are affected by factors such as lexicon size and the distribution of word lengths. We show that these factors can substantially affect the statistics of a phonological network and propose a new method for making more robust comparisons. We then analyse eight languages, finding many commonalities but also some qualitative differences in their lexicon structure.
@inproceedings{ shoemark2016robust,
title={Towards robust cross-linguistic comparisons of phonological networks},
booktitle={Proc. SIGMORPHON 14},
author={Shoemark, Philippa and Goldwater, Sharon and Kirby, James and Sarkar, Rik},
year={2016},
place={Berlin},
pages={110–120}}
}
Tone and phonation in Southeast Asian languages. Marc Brunelle, James Kirby.
Language and Linguistics Compass, 10(4), 191-207. 2016.
[ pdf | bib
| abstract | publisher's site
]
Southeast Asia is often considered a quintessential Sprachbund where languages from five different language phyla have been converging typologically for millennia. One of the common features shared by many languages of the area is tone: several major national languages of the region have large tone inventories and complex tone contours. In this paper, we suggest a more fine-grained view. We show that in addition to a large number of atonal languages, the tone languages of the region are actually far more diverse than usually assumed, and employ phonation type contrasts at least as often as pitch. Along the same lines, we argue that concepts such as tone and register, while descriptively useful, can obscure important underlying similarities and impede our understanding of the behavior of phonetic properties, typological regularities and diachrony. We finally draw the reader’s attention to some issues of current interest in the study of tone and phonation in Southeast Asia and describe some technical developments that are likely to allow researchers to address new lines of research in years to come.
@article{ brunelle2016tone,
journal = {Language and Linguistics Compass},
author = {Marc Brunelle and James Kirby},
year = {2016},
title = {Tone and phonation in Southeast Asian languages},
volume = {10},
issue = {4},
pages = {191--207}
}
2015
Stop voicing and F0 perturbations: evidence from French and Italian. James Kirby, D. Robert Ladd.
Proceedings of the Eighteenth International Congress of Phonetic Sciences, Glasgow. 2015.
[ pdf |
bib
| abstract
]
We report new experimental evidence on consonant- induced F0 perturbations in two languages with prevoiced stops, French and Italian. A positive correlation between duration of voicing lead and F0 at the onset of post-release voicing is observed, consistent with the predictions of an automatic or biomechanical account of the source of this effect. While the findings do not strictly rule out a role for onset F0 as a controlled enhancement, they support the proposal that, if anything, the enhancement is of [-voice] or [stiff] rather than [+voice].
@incollection{ kirby2015stop,
author = {James Kirby and D. Robert Ladd},
year = {2015},
title = {Stop voicing and F0 perturbations: evidence from French and Italian},
booktitle = {Proceedings of the 18th International Congress of Phonetic Sciences},
address = {Glasgow}
}
Temporal and spectral properties of Madurese stops. Misnadin, Bert Remjisen, James Kirby.
Proceedings of the Eighteenth International Congress of Phonetic Sciences, Glasgow. 2015.
[ pdf |
bib
| abstract
]
Madurese is a language with a three-way laryngeal contrast and an unusual consonant-vowel co-occurrence restriction. We provide new data on the phonetic realisation of Madurese stops from a sample of 15 native speakers by examining VOT, f0 and two acoustic correlates of voice quality, H1*-H2* and H1*-A3*. Our data indicate that while f0 distinguishes voiced from voiceless (aspirated and unaspirated) stops, at least one voice quality measure contrasts voiced and voiceless aspirated stops with voiceless unaspirated stops, suggesting that the relationship between these features may be more complex than has previously been assumed. Madurese appears to be best described as 'register system' of the Mon-Khmer type, albeit one in which pitch and voice quality are dissociated.
@incollection { misnadin2015temporal,
author = {Misnadin and James Kirby and Bert Remijsen},
year = {2015},
title = {Temporal and spectral properties of Madurese stops},
booktitle = {Proceedings of the 18th International Congress of Phonetic Sciences},
address = {Glasgow}
}
Bias and population structure in the actuation of sound change. James Kirby, Morgan Sonderegger.
arXiv:1507.04420 [physics]. 2015.
[ arXiv |
bib
| abstract
]
Why do human languages change at some times, and not others? We address this longstanding question from a computational perspective, focusing on the case of sound change. Sound change arises from the pronunciation variability ubiquitous in every speech community, but most such variability does not lead to change. Hence, an adequate model must allow for stability as well as change. Existing theories of sound change tend to emphasize factors at the level of individual learners promoting one outcome or the other, such as channel bias (which favors change) or inductive bias (which favors stability). Here, we consider how the interaction of these biases can lead to both stability and change in a population setting. We find that population structure itself can act as a source of stability, but that both stability and change are possible only when both types of bias are active, suggesting that it is possible to understand why sound change occurs at some times and not others as the population-level result of the interplay between forces promoting each outcome in individual speakers. In addition, if it is assumed that learners learn from two or more teachers, the transition from stability to change is marked by a phase transition, consistent with the abrupt transitions seen in many empirical cases of sound change. The predictions of multiple-teacher models thus match empirical cases of sound change better than the predictions of single-teacher models, underscoring the importance of modeling language change in a population setting.
@article{ kirby2015bias,
author = {James Kirby and Morgan Sonderegger},
year = {2015},
title = {Bias and population structure in the actuation of sound change},
journal = {arXiv:1507.04420 [physics]},
url = {http://arxiv.org/abs/1507.04420}
}
Re-assessing tonal diversity and geographical convergence in Mainland Southeast Asia. Marc Brunelle, James Kirby.
In N. J. Enfield and B. Comrie (ed.), Languages of Mainland Southeast Asia: The State of the Art (pp. 82-110). Mouton de Gruyter. 2015.
[ pdf |
bib
| abstract
| publisher's site
]
Mainland Southeast Asia (MSEA) is often described as the quintessential Sprachbund in which languages belonging to different language families converge as a result of contact. In this paper, we look in detail at the evidence for convergence of a specific phonological feature, tone, as expressed by two of its phonetic correlates, pitch and voice quality. Based on a database of 197 languages and dialects, we assess the extent of tonal diversity in MSEA languages and construct a statistical model of the degree to which tonal inventories can be predicted on the basis of geographic proximity, genealogical relatedness and population size. We find that the most robust predictors of tonality in MSEA languages are family and word shape.
@InCollection{brunelle2015reassessing,
booktitle = {Languages of Mainland Southeast Asia: the state of the art},
address = {Berlin},
author = {Marc Brunelle and James Kirby},
pages = {82--110},
publisher = {Mouton de Gruyter},
year = {2015},
title = {Re-assessing tonal diversity and geographical convergence
in Mainland Southeast Asia},
editor = {Nick J. Enfield and Bernard Comrie}
}
2014
Acoustic transitions in Khmer word-initial clusters. James Kirby.
Proceedings of the 10th International Seminar on Speech Production, 234-237.
2014.
[ pdf |
bib
| abstract
| online proceedings
| poster version
| audio recordings
| supplementary materials
]
Onset clusters in Khmer (Cambodian) often appear with an acoustic transition between consonants, but the phonological status of these elements is indeterminate. If transitions result from gestural separation, they may disappear in fast speech. Acoustic analysis of data from 10 speakers shows that vocalic transitions in Khmer are found in largely predictable set of consonantal contexts. While their presence is modulated by speech rate, they never disappear completely, in some cases becoming more rather than less frequent in fast speech. Clusters containing transitions are generally longer in duration than those that do not, and are also longer than monosyllables containing a lexical schwas, but the transitions do not show any spectral evidence of a distinct gestural target. The possible interpretations of these findings are discussed in the context of the range of articulatory variation known to occur in the implementation of speech rate.
@Inproceedings{ kirby2014transitions,
address = {Cologne},
author = {James Kirby},
booktitle = {Proceedings of the 10th International Seminar on Speech Production (ISSP)},
editor = {Susanne Fuchs and Martine Grice and Anne Hermes and Leonardo Lancia and Doris M\"ucke},
pages = {234--37},
title = {Acoustic transitions in Khmer word-initial clusters},
year = {2014}
}
Incipient tonogenesis in Phnom Penh Khmer: Acoustic and perceptual studies. James Kirby.
Journal of Phonetics 43, 69-85.
2014.
[ pdf |
bib
| abstract
| online journal
| supplementary materials
| audio recordings
| trills
]
Unlike many languages of Southeast Asia, Khmer (Cambodian) is not a tone language. However, in the colloquial speech of the capital Phnom Penh, /r/ is lost in onsets, reportedly supplanted by a range of other acoustic cues such as aspiration, a falling- or low-rising f0 contour, breathy voice quality, and in some cases diphthongization, e.g. /krɑː/ ‘poor’ > [kɔ̀ɑ], [kʰɔ̌ɑ], [kɔ̤ɑ̤], /kru:/ ‘teacher’ > [khùː] [kʰǔː], [kṳː]. This paper presents the results of production and perception studies designed to shed light on this unusual sound change. Acoustic evidence shows that colloquial /CrV/ forms differ from reading pronunciation forms in terms of VOT, f0, and spectral balance measures, while a pair of perceptual studies demonstrate that f0 is a sufficient cue for listeners to distinguish underlying /CrV/-initial from /CV/-initial forms, but that F1 is not. I suggest that this sound change may have arisen via the perceptual reanalysis of changes in spectral balance, coupled with the coarticulatory influence of the dorsal gesture for /r/.
@Article{ kirby2014incipient-acoustic,
journal = {Journal of Phonetics},
author = {James Kirby},
pages = {69--85},
year = {2014},
title = {Incipient tonogenesis in Phnom Penh Khmer: Acoustic and perceptual studies}
volume = {43},
}
Incipient tonogenesis in Phnom Penh Khmer: Computational studies. James Kirby.
Laboratory Phonology 5(1), 195-230.
2014.
[ pdf |
bib
| abstract
| online journal
| code
]
In the colloquial Phnom Penh dialect of Khmer (Cambodian), lexical use of F0 is emerging together with an intermediate VOT category and breathy phonation following the loss of /r/ in onsets (e.g. /kruː/ ‘teacher’ > [khṳ̀ː]). I show how this incipient tonogenesis might arise in a series of computational simulations tracing the evolution of multivariate phonetic category distributions in a population of ideal observers. Acoustic production data from a fieldwork study conducted in Phnom Penh was used as the starting point for the simulations. After establishing that the basic framework predicted relative stability over time, two possible responses to a phonetic production bias were considered: one in which agents correctly identified the source of (and thereby compensated for) the effects of the bias, and one in which agents misattributed the acoustic effects of the bias as a property of the onset. Good qualitative fits to the empirical production data were found for the latter group of learners, while the outcome for compensating learners resembled production data from a related dialect. These results are consistent with the sudden and discontinuous nature of many sound changes, and suggest that what appear to be enhancement effects may also emerge under different assumptions about the number of cue dimensions accessible to or deemed relevant by the learner.
@Article{kirby2014incipient-comp,
volume = {5},
journal = {Laboratory Phonology},
author = {James Kirby},
pages = {195--230},
year = {2014},
number = {1},
title = {Incipient tonogenesis in Phnom Penh Khmer: Computational studies}
}
Assessing incomplete neutralization of final devoicing in German. Timo Röttger, Bodo Winter, Sven Grawunder, James Kirby, Martine Grice.
Journal of Phonetics 43, 11-25. 2014.
[
pdf
| bib
| abstract
| online journal
]
It has been claimed that the long established neutralization of the voicing distinction in domain final position in German is phonetically incomplete. However, many studies that have advanced this claim have subsequently been criticized on methodological grounds, calling incomplete neutralization into question. In three production experiments and one perception experiment we address these methodological criticisms. In the first production study, we address the role of orthography. In a large scale auditory task using pseudowords, we confirm that neutralization is indeed incomplete and suggest that previous null results may simply be due to lack of statistical power. In two follow-up production studies (Experiments 2 and 3), we rule out a potential confound of Experiment 1, namely that the effect might be due to accommodation to the presented auditory stimuli, by manipulating the duration of the preceding vowel. While the between-items design (Experiment 2) replicated the findings of Experiment 1, the between-subjects version (Experiment 3) failed to find a statistically significant incomplete neutralization effect, although we found numerical tendencies in the expected direction. Finally, in a perception study (Experiment 4), we demonstrate that the subphonemic differences between final voiceless and “devoiced” stops are audible, but only barely so. Even though the present findings provide evidence for the robustness of incomplete neutralization in German, the small effect sizes highlight the challenges of investigating this phenomenon. We argue that without necessarily postulating functional relevance, incomplete neutralization can be accounted for by recent models of lexical organization.
@Article{ roettger2014assessing,
journal = {Journal of Phonetics},
volue = {43}.
pages = {11--25},
author = {Timo B. Röttger and Bodo Winter and Sven Grawunder and James Kirby and Martine Grice},
year = {2014},
title = {Assessing incomplete neutralization of final devoicing in German}
}
Acquisition of covert contrast: an unsupervised learning approach. James Kirby.
In R. Baglini et al. (eds.), Proceedings of the 46th Annual Meeting of the Chicago Linguistic Society 46(2), 111-125. 2014 [2010].
[ pdf |
bib
| abstract
]
This paper explores the learnability of covert contrasts (impressionistically homophonous categories that can be reliably distinguished at the phonetic level) through a series of model-based clustering simulations. Allowing the models to learn both the number and parameters of those categories provides a way to explore the potential stability of category structures. The results indicate that while a statistical learner can be quite effective at inducing covert contrasts, success depends crucially on the number and distributional characteristics of the relevant cue dimensions.
@InProceedings{ kirby2010acquisition,
booktitle = {Proceedings of the 46th Annual Meeting of the Chicago Linguistic Society},
volume = {46},
author = {James Kirby},
year = {2014},
title = {Acquisition of covert contrast: an unsupervised learning approach},
number = {2},
pages = {111--125},
editor = {Rebekah Baglini and Tim Grinsell and Jonathan Keane and Adam Roth Singerman and Julia Thomas}
}
2013
A model of population dynamics applied to phonetic change. James Kirby, Morgan Sonderegger.
In Proceedings of the 35th Annual Conference of the Cognitive Science Society, 776-781. 2013.
[ pdf |
bib
| abstract
| corrigenda
]
We consider the problem of language evolution in a population setting, focusing on the case of continuous parameter learning. While theories of phonetic change tend to emphasize the types of transmission errors that could give rise to a shift in pronunciation norms, it is challenging to develop a model that allows for both stability as well as change. We model the acquisition of vowel-to-vowel coarticulation in both single- and multiple-teacher settings, considering progressively more restrictive prior learning biases. We demonstrate that both stability and change are possible at the population level, but only under fairly strong assumptions about the nature of learning and production biases.
@InProceedings{ kirby2013population,
address = {Austin, TX},
booktitle = {Proceedings of the 35th Annual Conference of the Cognitive Science Society},
author = {James Kirby and Morgan Sonderegger},
publisher = {Cognitive Science Society},
pages = {776--781},
year = {2013},
title = {A model of population dynamics applied to phonetic change},
editor = {M. Knauff and M. Pauen and N. Sebanz and I. Wachsmuth}
}
The role of probabilistic enhancement in phonologization. James Kirby.
In Alan Yu (ed.), Origins of sound change: approaches to phonologization (pp. 228-246). Oxford University Press. 2013.
[ pdf |
bib
| abstract
| corrigendum
]
This chapter argues for the role of probabilistic enhancement in phonologization through computational simulation of an ongoing sound change in Seoul Korean. Two challenges faced by a phonologization model of sound change are addressed: explaining which cues are selected for phonologization, and explaining why phonologization is often accompanied by dephonologization. It is proposed that cues are targeted for enhancement as a probabilistic function of their statistical reliability in signaling a contrast. Simulation results using empirically derived cue values are taken to support the idea that loss of contrast precision may drive the phonologization process.
@InCollection{kirby2013role,
booktitle = {Origins of sound change: approaches to phonologization},
address = {Oxford},
author = {James Kirby},
pages = {228--246},
publisher = {Oxford University Press},
year = {2013},
title = {The role of probabilistic enhancement in phonologization},
editor = {Alan Yu}
}
2012
Tracking the acquisition of L2 phonetic contrast. James Kirby, Alan Yu.
Poster presented at LabPhon 13, Stuttgart. 2012.
[ pdf |
bib
| abstract
]
This study investigated the persistence of phonetic cue restructuring in a naturalistic learning environment. 17 native English speaking L2 learners of Korean were tracked over an 8 week period to explore the time course of acquisition of novel phonological contrasts signaled by VOT and f0. Production and perception results suggest that learners can quickly learn to direct attention to a novel dimension even in the absence of explicit feedback, and that continued exposure has a small but significant impact on performance: participants were able to exert more accurate control over L2 phonetic dimensions over the course of the experiment.
@Misc{ kirby2012tracking,
note = {27 July 2012},
author = {James Kirby and Alan Yu},
year = {2012},
title = {Tracking the acquisition of L2 phonetic contrast},
howpublished = {Poster presented at LabPhon 13, Stuttgart}
}
2011
Modeling the acquisition of covert contrast. James Kirby.
In Proceedings of the Seventeenth International Congress of Phonetic Sciences, 1090--1093. 2011.
[ pdf |
bib
| abstract
]
This paper explores the learnability of covert contrasts (impressionistically homophonous categories that can be reliably distinguished at the phonetic level) through a series of model-based clustering simulations using human production data. Allowing the models to learn both the number and parameters of those categories provides a way to explore the potential stability of category structures. The results indicate that while a statistical learner can be quite effective at inducing covert contrasts, success depends crucially on the number and distributional characteristics of the relevant cue dimensions.
@InProceedings{ kirby2011modeling,
booktitle = {Proceedings of the Seventeenth International Congress of Phonetic Sciences},
author = {James Kirby},
pages = {1090--1093},
year = {2011},
title = {Modeling the acquisition of covert contrast},
editor = {Wai-Sum Lee and Eric Zee}
}
Multilingual learning with parameter co-occurrence clustering. Max Bane, Jason Riggle, James Kirby, John Sylak.
In Proceedings of the 39th Meeting of the North East Linguistic Society, 67-82. 2011.
[ pdf |
bib
| abstract
]
The computational task of language learning has long been a central issue in theoretical linguistics, and most work has focused on its monolingual formulation, in which the learner's sample is drawn from a single target language. This paper considers a minimal extension of the usual monolingual formulation to accommodate the multilingual setting, and presents a novel strategy for discriminating and learning languages within it by clustering grammatical properties according to their co-occurrence in the sample. The heuristic that we propose is generic in the sense that it is applicable within any parameterized linguistic theory for which it is feasible to compute the possible parameter-settings implied by observing a single input-output mapping; for purposes of concreteness and evaluation, we present the algorithm within the framework of Optimality Theory, using syllable structure grammars as a case study.
@InProceedings{ bane2011multilingual,
booktitle = {Proceedings of the 39th Meeting of the North East Linguistic Society},
address = {Amherst, MA},
author = {Max Bane and Jason Riggle and James Kirby and John Sylak},
pages = {67--82},
publisher = {GLSA},
year = {2011},
title = {Multilingual learning with parameter co-occurrence clustering},
editor = {S. Lima and K. Mullin and B. Smith}
}
A tone split in Taoping Qiang. James Kirby.
Paper presented at the 21st Meeting of the Southeast Asian Linguistics Society, Bangkok. 2011.
[ pdf |
bib
| abstract
]
Evans (2001a, 2001b) argues that modern Southern Qiang (SQ) developed tones through a somewhat typologically unusual pathway: after developing pitch accent from earlier lexical stress, the languages became increasingly ‘tone-prone’ following phonological reduction of syllables and the segmental inventory (Matisoff, 1998), developing tonal systems after heavy borrowing from Mandarin. Here, I suggest that otherwise phonologically conservative Taoping Qiang also shows evidence of more ‘traditional’ tonogenetic mechanisms, which may have conditioned a tone split from the original *H reflex.
@Misc{ kirby2011qiang,
note = {11 May 2011},
author = {James Kirby},
year = {2011},
title = {A tone split in Taoping Qiang},
howpublished = {Paper presented at the 21st Meeting of the Southeast Asian Linguistics Society, Bangkok}
}
Vietnamese (Hanoi Vietnamese). James Kirby.
Journal of the International Phonetic Association 41(3), 381-392.
2011.
[ pdf |
bib
| abstract
| online journal
]
Vietnamese, the official language of Vietnam, is spoken natively by over seventy-five
million people in Vietnam and greater Southeast Asia as well as by some two million
overseas, predominantly in France, Australia, and the United States. This IPA illustration gives an overview of the phonetics and phonology of the Hanoi dialect.
@Article{kirby2011vietnamese,
volume = {41},
journal = {Journal of the International Phonetic Association},
author = {James Kirby},
pages = {381--392},
year = {2011},
title = {Vietnamese (Hanoi Vietnamese)},
number = {3}
}
2010
Cue selection and category restructuring in sound change. James Kirby.
Ph.D. Dissertation, University of Chicago, 2010.
[ pdf |
bib
| abstract
]
Changes to the realization of phonetic cues, such as vowel length or voice onset time, can have differential effects on the system of phonological categories. In some cases, variability or bias in phonetic realization may cause a contrast between categories to collapse, while in other cases, the contrast may persist through the phonologization of a redundant cue (Hyman, 1976). The goals of this dissertation are to better understand the subphonemic conditions under which a contrast is likely to survive and when it is likely to collapse, as well as to understand why certain cues are more likely to be phonologized than others.
I explore these questions by considering the transmission of speech sounds over a noisy channel (Shannon and Weaver, 1948), hypothesizing that when the precision of a contrast along one acoustic dimension is reduced, other dimensions may be enhanced to compensate (the probabilistic enhancement hypothesis). Whether this results in phonologization or neutralization depends on both the degree to which the contrast is threatened as well as the informativeness of the cues that signal it.
In order to explore this hypothesis, phonological categories are modeled as finite mixtures, which provide a natural way to generate, classify, and cluster objects in a multivariate setting. These mixtures are then embedded in an agent-based simulation framework and used to simulate the ongoing process of phonologization of pitch in Seoul Korean (Silva, 2006a,b; Kang and Guion, 2008). The results demonstrate that adaptive enhancement can account for both cue selection as well as the appearance of cue trading in phonologization. Additional data from the incomplete neutralization of final voicing in Dutch (Warner, Jongman, Sereno and Kemps, 2004) are then used to show how variation in phonetic realization can influence the loss or maintenance of phonological categories. Together, these case studies illustrate how variation in production and perception of subphonemic cues can impact the system of phonological contrasts.
@PhDThesis{ kirby2010cue,
author = {James Kirby},
year = {2010},
title = {Cue selection and category restructuring in sound change},
school = {University of Chicago}
}
Dialect experience in Vietnamese tone perception. James Kirby.
Journal of the Acoustical Society of America 127(4), 3749-3757.
2010.
[ pdf |
bib
| abstract
| online journal
]
This study investigated the perceptual dimensions of tone in Vietnamese and the effect of dialect experience on listener’s prelinguistic perception of tone. While Northern Vietnamese tones are cued by a combination of pitch and voice quality, Southern Vietnamese tones are purely pitch based. 30 listeners from two Vietnamese dialects (10 Northern, 20 Southern) participated in a speeded AX discrimination task using northern stimuli. The resulting reaction times were used to compute an INDSCAL multidimensional scaling solution and were submitted to hierarchical clustering analysis. While the analysis revealed a similar three-dimensional perceptual space structure for both listener groups, corresponding roughly to f0 offset, voice quality, and contour type, the relative salience of these dimensions varied by dialect: Southern listeners were more likely to confuse tones produced with nonmodal voice quality, whereas Northern listeners found tones with similar pitch excursions to be more confusable. The results of hierarchical clustering of the stimuli further support an analysis where low-level perceptual similarity is influenced by primary dialect experience.
@Article{kirby2010dialect,
volume = {127},
journal = {Journal of the Acoustical Society of America},
author = {James Kirby},
pages = {3749--3757},
year = {2010},
title = {Dialect experience in Vietnamese tone perception},
number = {4}
}
2009
Comparative-induced event measure relations. James Kirby.
Paper presented at the 83rd Annual Meeting of the Linguistic Society of America, San Francisco. 2009.
[ pdf |
bib
| abstract
]
In Vietnamese quantity comparison structures, differentials are prohibited from appearing phrase-internally. I argue this is because they are athematic measure phrases. However, this leads to a semantic type clash given the meaning of the comparative. I propose to resolve this by means of a comparative-induced event measure relation which type-shifts the predicate in the appropriate context. This relation is also shown to be active in English, suggesting that it may be a more general property of predicates cross-linguistically.
@Misc{ kirby2009comparative,
note = {10 January 2009},
author = {James Kirby},
year = {2009},
title = {Comparative-induced event measure relations},
howpublished = {Paper presented at the 83rd Annual Meeting of the Linguistic Society of America, San Francisco}
}
Morphological paradigm effects on vowel realization. James Kirby, Alan Yu.
University of Chicago ms. 2009.
[ pdf |
bib |
abstract
]
Previous studies have shown phonetic variation can be lexically conditioned (Wright, 1997; Munson and Solomon, 2004; Munson, 2007; Scarborough, 2006). Morphological paradigms have also been implicated in phonetic variation (Steriade, 2000; Kuperman et al., 2007). This paper investigates the nature of morphological paradigm effects on vowel production in German verbs. We report the results of a production experiment showing that, while paradigmatic complexity affects vowel dispersion, the effect is mediated by word frequency.
@Misc{ kirby2009morphological,
author = {James Kirby and Alan Yu},
year = {2009},
title = {Morphological paradigm effects on vowel realization},
howpublished = {University of Chicago ms}
}
2008
vPhon: a Vietnamese phonetizer. James Kirby. 2008.
[ bib |
code
]
@Misc{kirby2007vphon,
author = {James Kirby},
year = {2008},
title = {vPhon: a Vietnamese phonetizer (version 0.2.4)},
howpublished = {Retrieved on (date) from http://lel.ed.ac.uk/~jkirby/vphon.html}
}
2007
Erculator: a Web application for constraint-based phonology. Jason Riggle, Max Bane, James Kirby, Ed King, Heather Rivers, John Sylak.
In University of Massachusetts Occasional Papers in Linguistics 36: Papers in Theoretical and Computational Phonology, 135--150. 2007.
[ bib
]
@InProceedings{riggle2007erculator,
booktitle = {University of Massachusetts Occasional Papers in Linguistics 36: Papers in Theoretical
and Computational Phonology},
author = {Jason Riggle and Max Bane and James Kirby and Ed King and Heather Rivers and John Sylak},
pages = {135--150},
year = {2007},
title = {Erculator: a Web application for constraint-based phonology},
editor = {Michael Becker}
}
Lexical and phonotactic effects on wordlikeness judgments in Cantonese. James Kirby, Alan Yu.
In Proceedings of the Sixteenth International Congress of Phonetic Sciences, 1389--1392. 2007.
[ pdf |
bib
| abstract
| conference site | data and code
]
This paper reports the results of a wordlikeness task designed to investigate Cantonese speakers’ gradient phonotactic knowledge of systematic versus accidental phonotactic gaps. Regression analyses found that wordlikeness judgments correlate with token frequency-weighted neighborhood density and transitional (bigram) probability. This is suggested to be an effect of the relative phonological densities of the Cantonese and English lexica.
@InProceedings{ kirby2007lexical,
booktitle = {Proceedings of the Sixteenth International Congress of Phonetic Sciences},
author = {James Kirby and Alan Yu},
pages = {1389--1392},
year = {2007},
title = {Lexical and phonotactic effects on wordlikeness judgments in Cantonese}
}
2006
Intrusive vowels in Cruceño Spanish. Cindy Kilpatrick, Kathryn McGee, James Kirby.
Poster presented at the 4th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan, Honolulu. 2006.
[ pdf |
bib
]
@Misc{ kilpatrick2006intrusive,
note = {2 December 2006},
author = {Cindy Kilpatrick and Kathryn McGee and James Kirby},
year = {2006},
title = {Intrusive vowels in Cruceño Spanish},
howpublished = {Poster presented at the 4th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan, Honolulu}
}
Vietnamese and the structure of NP. James Kirby.
University of Chicago ms. 2006.
[ pdf |
bib
]
@Misc{ kirby2006vietnamese,
author = {James Kirby},
year = {2006},
title = {Vietnamese and the structure of NP},
howpublished = {University of Chicago ms}
}
This page generated semi-automatically from my BibTeX file using modified scripts
originally by Charles Sutton. Original scripts here.