50
Views
0
CrossRef citations to date
0
Altmetric
Cloud Computing for Big Data Processing

Chinese WeChat and Blog Hot Words Detection Method Based on Chinese Semantic Clustering

, , &
Pages 613-618 | Published online: 05 Oct 2017
 

Abstract

This paper proposes a hot topic detection method based on Chinese semantic clustering. The method is aimed at high-dimensional Chinese WeChat and fragmentation of information. In order to analysis the sparse and content fragmentation features of Chinese WeChat and Blog data, we combine multiple strategies that repeated string computation, context adjacency analysis and linguistic rule filtering to abstract meaningful sentences, which can express independent and complete semantics. Then we construct the model of Chinese WeChat data in a relatively small and meaningful string space, and generate candidates’ topics via feature clustering and pick up the hot topics according to the heat sorting. The experimental result on the WeChat data and Blog data shows that the method can reduce the dimension of high-dimension sparse space of the blog in a way, which is effective and feasible to the WeChat hot topic detection method.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access
  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart
* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.