Bug fixes Prioritize module.builder_kwargs over defaults in TestCommand #3672 (@lvwerra) Fix TestCommand to copy dataset_infos to local dir with only data files #3680 (@albertvillanova) Upgrade black to version ~=22.0 #3691 (@LysandreJik) Fix streaming for servers not supporting HTTP range requests #3689 (@albertvillanova) Pin ElasticSearch #3701 (@lhoestq) Fix ClassLabel to/from dict when passed names_file #3695 (@albertvillanova) Fix CI code quality issue #3710 (@albertvillanova) Check if indices values in Dataset.select are within bounds #3719 (@mariosasko) Pin pandas to avoid bug in streaming mode #3725 (@albertvillanova) Use config pandas version in CSV dataset builder #3726 (@albertvillanova) Fix dataset mirroring (@lhoestq) Fix Valu...
Bug fixes Fix filter indices when batched by @albertvillanova in https://github.com/huggingface/dat...
Dataset changes Update: Adapt all audio datasets #3081 (@patrickvonplaten) Bug fixes Update BibTe...
Improvements Make decoding of Audio and Image feature optional by @mariosasko in https://github.com...
Bug fixes Fix streaming datasets that are not reset correctly by @lhoestq in https://github.com/hug...
Datasets Changes New: Add Russian SuperGLUE #2668 (@slowwavesleep) New: Add Disfl-QA #2473 (@bha...
Bug fixes Fix MP3 resampling when a dataset's audio files have different sampling rates by @lhoestq...
Datasets fixes Fix: irc_disentangle - fix checksum and bug dataset by @albertvillanova in https://g...
Datasets Changes New: Microsoft CodeXGlue Datasets #2357 (@ncoop57) New: KLUE benchmark #2416 (@...
Bug fixes Fix import datasets on python 3.10 by @lhoestq in https://github.com/huggingface/datasets...
Dataset changes Update: LexGLUE and MultiEURLEX README - update dataset cards #3075 (@iliaschalki...
Bug fixes Fix patching module that doesn't exist by @lhoestq in https://github.com/huggingface/data...
Datasets Changes New: C4 #2575 #2592 (@lhoestq) New: mC4 #2576 (@lhoestq) New: MasakhaNER #2465...
Datasets bug fixes Fix cnn_dailymail (dm stories were ignored) by @lhoestq in https://github.com/hu...
Bug fixes Fix double dots in data files by @lhoestq in https://github.com/huggingface/datasets/pull...
Dataset Changes New: NLU evaluation data #2238 (@dkajtoch) New: Add SLR32, SLR52, SLR53 to OpenS...
Bug fixes Fix filter indices when batched by @albertvillanova in https://github.com/huggingface/dat...
Dataset changes Update: Adapt all audio datasets #3081 (@patrickvonplaten) Bug fixes Update BibTe...
Improvements Make decoding of Audio and Image feature optional by @mariosasko in https://github.com...
Bug fixes Fix streaming datasets that are not reset correctly by @lhoestq in https://github.com/hug...
Datasets Changes New: Add Russian SuperGLUE #2668 (@slowwavesleep) New: Add Disfl-QA #2473 (@bha...
Bug fixes Fix MP3 resampling when a dataset's audio files have different sampling rates by @lhoestq...
Datasets fixes Fix: irc_disentangle - fix checksum and bug dataset by @albertvillanova in https://g...
Datasets Changes New: Microsoft CodeXGlue Datasets #2357 (@ncoop57) New: KLUE benchmark #2416 (@...
Bug fixes Fix import datasets on python 3.10 by @lhoestq in https://github.com/huggingface/datasets...
Dataset changes Update: LexGLUE and MultiEURLEX README - update dataset cards #3075 (@iliaschalki...
Bug fixes Fix patching module that doesn't exist by @lhoestq in https://github.com/huggingface/data...
Datasets Changes New: C4 #2575 #2592 (@lhoestq) New: mC4 #2576 (@lhoestq) New: MasakhaNER #2465...
Datasets bug fixes Fix cnn_dailymail (dm stories were ignored) by @lhoestq in https://github.com/hu...
Bug fixes Fix double dots in data files by @lhoestq in https://github.com/huggingface/datasets/pull...
Dataset Changes New: NLU evaluation data #2238 (@dkajtoch) New: Add SLR32, SLR52, SLR53 to OpenS...
Bug fixes Fix filter indices when batched by @albertvillanova in https://github.com/huggingface/dat...
Dataset changes Update: Adapt all audio datasets #3081 (@patrickvonplaten) Bug fixes Update BibTe...
Improvements Make decoding of Audio and Image feature optional by @mariosasko in https://github.com...