The users of the most widespread Software Engineering dedicated forum, Stack Overflow (SO), are confronted by the issue of posting duplicate questions and spending time waiting for an answer. Currently, only the SO users with a high reputation and the moderators manually determine this type of post. Hence, an automatic solution can save substantial time and work. As a solution, we propose a system split into three components.First, the textual information component is an ML-based solution to decide whether a question pair is a duplicate or not by analyzing its encoded version. Additionally, we use the Doc2Vec model for question embedding, which considers the title and body as input. As a second feature, we build a tag analyzer. Lastly, we i...
User queries on Stack Overflow commonly suffer from either inadequate length or inadequate\nclarity ...
Motivation: Document similarity metrics such as PubMed’s “Find related articles ” feature, which hav...
Dataset-1: Question-answer pairs on “Lucene” collected from StackOverflow. As mentioned in paper [36...
Abstract Stack Overflow is a popular on-line question and answer site for software developers to sha...
Duplicate questions on Stack Overflow are questions that are flagged as being conceptually equivalen...
Community based question answering forums are very popular these days. People tend to refer communi...
Community-based Question Answering (CQA) websites are attracting increasing numbers of users and con...
There has a been a significant rise in the use of Community Question Answering sites (CQAs) over the...
© 2018 Dr. Doris HoogeveenCommunity question-answering (cQA) sites are websites that people visit to...
In this paper we introduce the task of misflagged duplicate question detection for question pairs in...
Programming question and answer (Q&A) websites, such as Quora, Stack Overflow, and Yahoo! Answer etc...
In community question answering (CQA), duplicate questions are questions that were previously create...
With the massive volume of text available online these days, text categorization has become a very u...
StackOverflow is a very popular Q&A website, known to all software developers. Developers can ei...
Community question and answer (CQA) sites mostly involve knowledge-bases feeding into their automate...
User queries on Stack Overflow commonly suffer from either inadequate length or inadequate\nclarity ...
Motivation: Document similarity metrics such as PubMed’s “Find related articles ” feature, which hav...
Dataset-1: Question-answer pairs on “Lucene” collected from StackOverflow. As mentioned in paper [36...
Abstract Stack Overflow is a popular on-line question and answer site for software developers to sha...
Duplicate questions on Stack Overflow are questions that are flagged as being conceptually equivalen...
Community based question answering forums are very popular these days. People tend to refer communi...
Community-based Question Answering (CQA) websites are attracting increasing numbers of users and con...
There has a been a significant rise in the use of Community Question Answering sites (CQAs) over the...
© 2018 Dr. Doris HoogeveenCommunity question-answering (cQA) sites are websites that people visit to...
In this paper we introduce the task of misflagged duplicate question detection for question pairs in...
Programming question and answer (Q&A) websites, such as Quora, Stack Overflow, and Yahoo! Answer etc...
In community question answering (CQA), duplicate questions are questions that were previously create...
With the massive volume of text available online these days, text categorization has become a very u...
StackOverflow is a very popular Q&A website, known to all software developers. Developers can ei...
Community question and answer (CQA) sites mostly involve knowledge-bases feeding into their automate...
User queries on Stack Overflow commonly suffer from either inadequate length or inadequate\nclarity ...
Motivation: Document similarity metrics such as PubMed’s “Find related articles ” feature, which hav...
Dataset-1: Question-answer pairs on “Lucene” collected from StackOverflow. As mentioned in paper [36...