International audienceIn this work, we evaluate the performance of recent text embeddings for the automatic detection of events in a stream of tweets. We model this task as a dynamic clustering problem.Our experiments are conducted on a publicly available corpus of tweets in English and on a similar dataset in French annotated by our team. We show that recent techniques based on deep neural networks (ELMo, Universal Sentence Encoder, BERT, SBERT), although promising on many applications, are not very suitable for this task. We also experiment with different types of fine-tuning to improve these results on French data. Finally, we propose a detailed analysis of the results obtained, showing the superiority of tf-idf approaches for this task....
In this work, we present an event detection method in Twitter based on clustering of hashtags and in...
In the latest years, the Web has shifted from a read-only medium where most users could only consume...
Empiriical thesis.Bibliography: pages 107-122.1. Introduction -- 2. Literature review -- 3. TwitterN...
International audienceIn this work, we evaluate the performance of recent text embeddings for the au...
We present Event2018, a corpus annotated for event detection tasks, consisting of 38 million tweets ...
International audienceMultiword expression (MWE) identification in tweets is a complex task due to t...
In this paper we show how the performance of tweet clustering can be improved by leveraging characte...
Event detection from social media messages is conventionally based on clustering the message content...
The proliferation of social media and user-generated content in the Web has opened new opportunities...
Social media are full of information, which can be useful, of interest. However, the large amount of...
This paper aims to enhance event detection methods in a micro-blogging platform, namely Twitter. The...
International audienceThis articles describes the methods developed by the TWEETANEUSE team for the ...
In recent years, substantial research efforts have gone into investigating different approaches to t...
International audienceEvent detection on Twitter has become an attractive and challenging research f...
In the latest years, the Web has shifted from a read-only medium where most users could only consume...
In this work, we present an event detection method in Twitter based on clustering of hashtags and in...
In the latest years, the Web has shifted from a read-only medium where most users could only consume...
Empiriical thesis.Bibliography: pages 107-122.1. Introduction -- 2. Literature review -- 3. TwitterN...
International audienceIn this work, we evaluate the performance of recent text embeddings for the au...
We present Event2018, a corpus annotated for event detection tasks, consisting of 38 million tweets ...
International audienceMultiword expression (MWE) identification in tweets is a complex task due to t...
In this paper we show how the performance of tweet clustering can be improved by leveraging characte...
Event detection from social media messages is conventionally based on clustering the message content...
The proliferation of social media and user-generated content in the Web has opened new opportunities...
Social media are full of information, which can be useful, of interest. However, the large amount of...
This paper aims to enhance event detection methods in a micro-blogging platform, namely Twitter. The...
International audienceThis articles describes the methods developed by the TWEETANEUSE team for the ...
In recent years, substantial research efforts have gone into investigating different approaches to t...
International audienceEvent detection on Twitter has become an attractive and challenging research f...
In the latest years, the Web has shifted from a read-only medium where most users could only consume...
In this work, we present an event detection method in Twitter based on clustering of hashtags and in...
In the latest years, the Web has shifted from a read-only medium where most users could only consume...
Empiriical thesis.Bibliography: pages 107-122.1. Introduction -- 2. Literature review -- 3. TwitterN...