Code stylometry is applying analysis techniques to a collection of source code or binaries to determine variations in style. The variations extracted are often used to identify the author of the text or to differentiate one piece from another. In this research, we were able to create a multi-input deep learning model that could accurately categorize and group code from multiple projects. The deep learning model took as input word-based tokenization for code comments, character-based tokenization for the source code text, and the metadata features described by A. Caliskan-Islam et al. Using these three inputs, we were able to achieve 90% validation accuracy with a loss value of 0.1203 using 12 projects consisting of 5,877 files. Finally, we ...
The analysis of this project it is used the CRISP-DM method. Smart Property Valuation (SPV) is a fic...
The advent of next-generation sequencing (NGS) technology has shown unprecedented promise for accura...
With the increasing demand placed on online systems by users, many organizations and companies are s...
Machine learning is the new frontier for technology development in geosciences and has developed ext...
This MQP was created for professors in the computer science department – specifically those teaching...
Research into the combination of data mining and machine learning technology with web-based educatio...
Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science a...
The article of record as published may be found at https://doi.org/10.1016/j.physd.2021.132955Recent...
The representation of 2 Satisfiability problem or 2SAT is increasingly viewed as a significant logic...
Studies have shown that topic modeling with Latent Dirichlet Allocation (LDA) is a useful (semi-)uns...
In this thesis, the perturbation-based decomposition technique developed by Szlavik [1] was used in ...
This paper explores current processes in archival appraisal and selection and investigates the poten...
Approximate Arithmetic is a task that requires one to approximate the number of dots in dot arrays t...
This exploratory study applies social network analysis techniques to existing, publicly available da...
Knowledge graphs provide machines with structured knowledge of the world. Structured, machine-readab...
The analysis of this project it is used the CRISP-DM method. Smart Property Valuation (SPV) is a fic...
The advent of next-generation sequencing (NGS) technology has shown unprecedented promise for accura...
With the increasing demand placed on online systems by users, many organizations and companies are s...
Machine learning is the new frontier for technology development in geosciences and has developed ext...
This MQP was created for professors in the computer science department – specifically those teaching...
Research into the combination of data mining and machine learning technology with web-based educatio...
Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science a...
The article of record as published may be found at https://doi.org/10.1016/j.physd.2021.132955Recent...
The representation of 2 Satisfiability problem or 2SAT is increasingly viewed as a significant logic...
Studies have shown that topic modeling with Latent Dirichlet Allocation (LDA) is a useful (semi-)uns...
In this thesis, the perturbation-based decomposition technique developed by Szlavik [1] was used in ...
This paper explores current processes in archival appraisal and selection and investigates the poten...
Approximate Arithmetic is a task that requires one to approximate the number of dots in dot arrays t...
This exploratory study applies social network analysis techniques to existing, publicly available da...
Knowledge graphs provide machines with structured knowledge of the world. Structured, machine-readab...
The analysis of this project it is used the CRISP-DM method. Smart Property Valuation (SPV) is a fic...
The advent of next-generation sequencing (NGS) technology has shown unprecedented promise for accura...
With the increasing demand placed on online systems by users, many organizations and companies are s...