Educational data mining involves the application of data mining techniques to student activity. However, in the context of computer programming, many data mining techniques can not be applied because they require vector-shaped input, whereas computer programs have the form of syntax trees. In this paper, we present ast2vec, a neural network that maps Python syntax trees to vectors and back, thereby enabling about a hundred data mining techniques that were previously not applicable. Ast2vec has been trained on almost half a million programs of novice programmers and is designed to be applied across learning tasks without re-training, meaning that users can apply it without any need for deep learning. We demonstrate the generality of ast2vec ...
INST: L_042This thesis is related to Educational data mining. This project is implemented in the Pyt...
Educational data mining and learning analytics promise better understanding of student behavior and ...
Currently, the data recorded in the educational context pres-ent two main challenges to data mining ...
Source code mining has received increasing attention, among which code classification plays a signif...
[[abstract]]From traditional face-to-face courses, asynchronous distance learning, synchronous live ...
Program code analysis is an important component in several kinds of intelligent educational systems ...
Deep neural networks have made significant break-throughs in many fields of artificial intelligence....
Methods to encode high-dimensional features into dense vectors or “embeddings” have gained attention...
[[abstract]]Design and analysis of algorithms are a fundamental topic in computer science and engine...
An effective and efficient code representation is critical to the success of sequence-to-sequence de...
Abstract—Deep learning has made significant breakthroughs in various fields of artificial intelligen...
In this work, we propose a new methodology to profile individual students of computer science based ...
A code generation system generates programming language code based on an input natural language desc...
Due to the increase in student numbers, the amount of time teachers have for each individual decreas...
Educational data mining and learning analytics promise better understanding of student behavior and ...
INST: L_042This thesis is related to Educational data mining. This project is implemented in the Pyt...
Educational data mining and learning analytics promise better understanding of student behavior and ...
Currently, the data recorded in the educational context pres-ent two main challenges to data mining ...
Source code mining has received increasing attention, among which code classification plays a signif...
[[abstract]]From traditional face-to-face courses, asynchronous distance learning, synchronous live ...
Program code analysis is an important component in several kinds of intelligent educational systems ...
Deep neural networks have made significant break-throughs in many fields of artificial intelligence....
Methods to encode high-dimensional features into dense vectors or “embeddings” have gained attention...
[[abstract]]Design and analysis of algorithms are a fundamental topic in computer science and engine...
An effective and efficient code representation is critical to the success of sequence-to-sequence de...
Abstract—Deep learning has made significant breakthroughs in various fields of artificial intelligen...
In this work, we propose a new methodology to profile individual students of computer science based ...
A code generation system generates programming language code based on an input natural language desc...
Due to the increase in student numbers, the amount of time teachers have for each individual decreas...
Educational data mining and learning analytics promise better understanding of student behavior and ...
INST: L_042This thesis is related to Educational data mining. This project is implemented in the Pyt...
Educational data mining and learning analytics promise better understanding of student behavior and ...
Currently, the data recorded in the educational context pres-ent two main challenges to data mining ...