The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its nature and implications, (ii) offer one or more examples of how it is manifested in actual linguistic communities, and (iii) where possible, offer recommendations for addressing it effectively. Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages and how it affects longitudinal collection of big amo...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
The COVID-19 pandemic has massively limited how linguists can collect data, and out of necessity, re...
Minority languages are underrepresented in linguistic research, and a possible reason for this is th...
The aim of this work is to identify and analyze a set of challenges that are likely to be encountere...
The aim of this work is to identify and analyze a set of challenges that are likely to be encountere...
Mobile communication tools and platforms provide various opportunities for users to interact over so...
Implicit or explicit in many discussions of language documentation is the assumption that the langua...
While many have been focussed on methodological issues of documenting endangered languages, others h...
The most important reasons for examining “non-standard data” with CL methods are the facts that this...
The Kamusi Project, a multilingual online dictionary website, has as one of its goals to document th...
While many have been focussed on methodological issues of documenting endangered languages, others h...
Item does not contain fulltextUnderstanding worldwide patterns of language diversity has long been a...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
The increasing availability of large digital corpora of cross-linguistic data is revolutionizing man...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
The COVID-19 pandemic has massively limited how linguists can collect data, and out of necessity, re...
Minority languages are underrepresented in linguistic research, and a possible reason for this is th...
The aim of this work is to identify and analyze a set of challenges that are likely to be encountere...
The aim of this work is to identify and analyze a set of challenges that are likely to be encountere...
Mobile communication tools and platforms provide various opportunities for users to interact over so...
Implicit or explicit in many discussions of language documentation is the assumption that the langua...
While many have been focussed on methodological issues of documenting endangered languages, others h...
The most important reasons for examining “non-standard data” with CL methods are the facts that this...
The Kamusi Project, a multilingual online dictionary website, has as one of its goals to document th...
While many have been focussed on methodological issues of documenting endangered languages, others h...
Item does not contain fulltextUnderstanding worldwide patterns of language diversity has long been a...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
The increasing availability of large digital corpora of cross-linguistic data is revolutionizing man...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
The COVID-19 pandemic has massively limited how linguists can collect data, and out of necessity, re...
Minority languages are underrepresented in linguistic research, and a possible reason for this is th...