A lightning talk delivered at the Library Research and Innovative Practice Forum, McKeldin Library, June 4, 2015. The tool described is available at http://www.github.com/jwestgard/csv-validate/.This lightning talk describes a Python script for the validation of CSV files against arbitrary sets of rules specified in a schema file. The motivation for creating the tool was that CSV (comma-separated values) files have become a de facto standard for moving data between systems, and for any sort of batch ingest process. But CSV data can be messy, and often there are problems that appear only when the data is being loaded, after it is out of the hands of the librarians who have created the data and into the hands of systems staff. The tool is ...
This Perl script connects to a given CSW catalog, downloads metadata posts as a XML file and parses ...
Because TDWG vocabularies change and grow as they are developed by the community, it is nearly impos...
The 2007 British Atmospheric Data Centre (BADC) Users Survey examined the skill base of the BADC’s u...
A simple, but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpac...
Comma-separated-values (CSV) is a useful data serialization and sharing format. This talk introduces...
Validation Set: We have shared 3 CSV files containing human-annotated validation sets of our paper (...
31 pages including 2p of AnnexThe CSVM format is derived from CSV format and allows the storage of t...
Tato práce se zaměřuje na tvorbu validátoru Comma-separated values (CSV) souborů podle W3C doporučen...
In 2016, the University of Waterloo began offering a mediated copyright review and deposit service t...
CSVM (CSV with Metadata) is a simple file format for tabular data. The possible application domain i...
This dataset contains two files: original_data.zip, and website_5folds.zip original_data.zip will u...
Over the last few years, the emphasis on data quality has evolved from being a nice to have to an ab...
Comma-Separated Values (CSV) files are commonly used to publish data about environmental phenomena a...
This talk will present a new data management IDE for CSV and other formats that provides functionali...
This fileset contains a preprint version of the conference paper (.pdf), presentation slides (as .pp...
This Perl script connects to a given CSW catalog, downloads metadata posts as a XML file and parses ...
Because TDWG vocabularies change and grow as they are developed by the community, it is nearly impos...
The 2007 British Atmospheric Data Centre (BADC) Users Survey examined the skill base of the BADC’s u...
A simple, but opinionated metadata quality checker and fixer designed to work with CSVs in the DSpac...
Comma-separated-values (CSV) is a useful data serialization and sharing format. This talk introduces...
Validation Set: We have shared 3 CSV files containing human-annotated validation sets of our paper (...
31 pages including 2p of AnnexThe CSVM format is derived from CSV format and allows the storage of t...
Tato práce se zaměřuje na tvorbu validátoru Comma-separated values (CSV) souborů podle W3C doporučen...
In 2016, the University of Waterloo began offering a mediated copyright review and deposit service t...
CSVM (CSV with Metadata) is a simple file format for tabular data. The possible application domain i...
This dataset contains two files: original_data.zip, and website_5folds.zip original_data.zip will u...
Over the last few years, the emphasis on data quality has evolved from being a nice to have to an ab...
Comma-Separated Values (CSV) files are commonly used to publish data about environmental phenomena a...
This talk will present a new data management IDE for CSV and other formats that provides functionali...
This fileset contains a preprint version of the conference paper (.pdf), presentation slides (as .pp...
This Perl script connects to a given CSW catalog, downloads metadata posts as a XML file and parses ...
Because TDWG vocabularies change and grow as they are developed by the community, it is nearly impos...
The 2007 British Atmospheric Data Centre (BADC) Users Survey examined the skill base of the BADC’s u...