Loading [MathJax]/jax/output/SVG/config.js
Proceedings of the Institute for System Programming of the RAS
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Proceedings of ISP RAS:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Proceedings of the Institute for System Programming of the RAS, 2020, Volume 32, Issue 4, Pages 165–174
DOI: https://doi.org/10.15514/ISPRAS-2020-32(4)-12
(Mi tisp532)
 

This article is cited in 1 scientific paper (total in 1 paper)

Two step method for grouping news with similar topics

K. A. Skorniakovab, A. S. Laskinaab, D. Yu. Turdakovbc

a Moscow Institute of Physics and Technology
b Ivannikov Institute for System Programming of the Russian Academy of Sciences
c Lomonosov Moscow State University
Full-text PDF (399 kB) Citations (1)
References:
Abstract: Amount of news is rapidly growing up in recent years. People cannot handle them effectively. This is the main reason why automatic methods of news stream analysis have become an important part of modern science. The paper is devoted to the part of the news stream analysis which is called “event detection”. “Event” is a group of news dedicated to one real-world event. We study news from Russian news agencies. We consider this task as clusterization on news and compare algorithms by external clusterization metrics. The paper introduces a novel approach to detect events at news in Russian language. We propose a two-staged clustering method. It comprises “rough” clustering algorithm at the first stage and clarifying classifier at the second stage. At the first stage, a combination of shingles method and naive named entity based clusterization is used. Also we present a labeled dataset of news event detection based on «Yandex News» service. This manually labeled dataset can be used to estimate event detection methods performance. Empirical evaluation on these corpora proved the effectiveness of the proposed method for event detection at news texts.
Keywords: event detection, clustering, news.
Funding agency Grant number
Russian Foundation for Basic Research 18-07-01059
This work was supported by a grant from the Russian Foundation For Basic Research No18-07-01059
Document Type: Article
Language: Russian
Citation: K. A. Skorniakov, A. S. Laskina, D. Yu. Turdakov, “Two step method for grouping news with similar topics”, Proceedings of ISP RAS, 32:4 (2020), 165–174
Citation in format AMSBIB
\Bibitem{SkoLasTur20}
\by K.~A.~Skorniakov, A.~S.~Laskina, D.~Yu.~Turdakov
\paper Two step method for grouping news with similar topics
\jour Proceedings of ISP RAS
\yr 2020
\vol 32
\issue 4
\pages 165--174
\mathnet{http://mi.mathnet.ru/tisp532}
\crossref{https://doi.org/10.15514/ISPRAS-2020-32(4)-12}
Linking options:
  • https://www.mathnet.ru/eng/tisp532
  • https://www.mathnet.ru/eng/tisp/v32/i4/p165
  • This publication is cited in the following 1 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Proceedings of the Institute for System Programming of the RAS
    Statistics & downloads:
    Abstract page:170
    Full-text PDF :100
    References:36
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025