论文部分内容阅读
中国电视网络传播监测与研究系统的信息源来自互联网上主流的门户类、传媒类、社区类、博客类、论坛类、微博类、社交类、视频类等网站,根据不同监测源的特点,采用定制化搜索与垂直搜索相结合的技术,通过自建服务器集群和云计算中心相结合的信息抓取平台,对于海量多元数据进行抓取。数据采集时间:2016.7.4-2016.7.31指标解释:视频点击量:指的是某一电视频道非UGC视频在互联网上被点击收看的次数。通过考察网民在互联网上收看视频节目的频次,用以反映网民的主动收看行为,彰显电视原创视频的二次传播效果。
The information sources of China TV network monitoring and research system come from websites such as mainstream portals, media, communities, blogs, forums, weibos, socials and videos on the Internet. According to the characteristics of different monitoring sources, Adopting a combination of customized search and vertical search technology, a platform for crawling information by combining a self-built server cluster and a cloud computing center is adopted to crawl massive multivariate data. Data Collection Time: 2016.7.4-2016.7.31 Explanation of Indicators: Video Hits: refers to the number of times that non-UGC videos of a certain television channel are clicked on the Internet. By examining the frequency of watching video programs on the Internet by netizens to reflect the active viewing behavior of Internet users, the secondary broadcast effect of original television videos is demonstrated.