A large scale collection of both semantic and natural language resources is essential to leverage active Software Engineering research areas such as code reuse and code comprehensibility. Existing machine learning models ingest data from Open Source repositories (like GitHub projects) and forum discussions(like Stackoverflow.com), whereas, in this showcase, we took a step backward to orchestrate a corpus titled PyTorrent that contains 218,814 Python package libraries for the first time and collected from PyPI and Anaconda environment. This is because earlier studies have shown that much of the code is redundant and Python packages from these environments are better in quality and are well-documented. PyTorrent enables users (such as data sc...
This upload contains datasets and pre-trained models used for the paper Neural Code Search Revisited...
Machine learning is a general-purpose technology holding promises for many interdisciplinary researc...
In the thesis we compare the systems for data mining that have an interface in the programming langu...
A large scale collection of both semantic and natural language resources is essential to leverage ac...
A raw code corpus for the Python programming language i.e., includes only the Python source files of...
This repository contains the dataset of the manuscript: "An Empirical Study on the Usage and Availa...
pyOpenSci (short for Python Open Science), funded by the Alfred P. Sloan Foundation, is building a d...
We present Code4ML: a Large-scale Dataset of annotated Machine Learning Code, a corpus of Python cod...
Open source Python modules, linguistic data and documentation for research and development in natura...
International audienceWe introduce pycobra, a Python library devoted to ensemble learning (regressio...
Python programming for Data Scientists Preface Python programming language is an open source progr...
We introduce a toolkit for working with the 13.6 million volume Extracted Features Dataset from the ...
Python programming language plays a crucial role in machine learning. Python's syntax is straightfor...
There is a good reason why more Americans have searched for Python than for Kim Kardashian in the la...
Software repositories contain historical and valuable information about the overall development of s...
This upload contains datasets and pre-trained models used for the paper Neural Code Search Revisited...
Machine learning is a general-purpose technology holding promises for many interdisciplinary researc...
In the thesis we compare the systems for data mining that have an interface in the programming langu...
A large scale collection of both semantic and natural language resources is essential to leverage ac...
A raw code corpus for the Python programming language i.e., includes only the Python source files of...
This repository contains the dataset of the manuscript: "An Empirical Study on the Usage and Availa...
pyOpenSci (short for Python Open Science), funded by the Alfred P. Sloan Foundation, is building a d...
We present Code4ML: a Large-scale Dataset of annotated Machine Learning Code, a corpus of Python cod...
Open source Python modules, linguistic data and documentation for research and development in natura...
International audienceWe introduce pycobra, a Python library devoted to ensemble learning (regressio...
Python programming for Data Scientists Preface Python programming language is an open source progr...
We introduce a toolkit for working with the 13.6 million volume Extracted Features Dataset from the ...
Python programming language plays a crucial role in machine learning. Python's syntax is straightfor...
There is a good reason why more Americans have searched for Python than for Kim Kardashian in the la...
Software repositories contain historical and valuable information about the overall development of s...
This upload contains datasets and pre-trained models used for the paper Neural Code Search Revisited...
Machine learning is a general-purpose technology holding promises for many interdisciplinary researc...
In the thesis we compare the systems for data mining that have an interface in the programming langu...