问题描述:
看看下面的英文有没有错误?
With the rapid development of the Internet,the Internet has become an important way to access to news and information.How to be more convenient,more comprehensive and more accurate to access to relevant information has become an issue.Traditional network media with single web site is difficult to satisfy the needs of users,the news search engine emerged.With the popularity of mobile phones and the improvement of its convenience,mobile news search has become a trend.
In this paper,a number of key technologies of the Mobile Chinese news search engine have been deeply analysised and researched,and a prototype system has been realized.The study includes the following main points:
1) Design and implementation of a news HTML pages text extraction algorithm based on the characteristics of human vision.The algorithm is based on the judgment of text as people.According to the factors incluing the number of Chinese characters,the number of hot words,the number of hyperlinks,a certain paragraphs of text can be determined,then by using the relationship of HTML nodes to determine the text of the news.Experiments indicate that with this method,text of the news pages can be accurately extracted and advertising and the other unrelevant parts can be well removed.Unlike traditional extraction method,not only unnecessary to adjust configuration according to different website and different channels,but also without pre-learning.
2) Design of mobile Chinese news search engine system,the concrete realization of the program,achieving a system prototype and made a number of improvements to the users’ experience as the next phase of work.
With the rapid development of the Internet,the Internet has become an important way to access to news and information.How to be more convenient,more comprehensive and more accurate to access to relevant information has become an issue.Traditional network media with single web site is difficult to satisfy the needs of users,the news search engine emerged.With the popularity of mobile phones and the improvement of its convenience,mobile news search has become a trend.
In this paper,a number of key technologies of the Mobile Chinese news search engine have been deeply analysised and researched,and a prototype system has been realized.The study includes the following main points:
1) Design and implementation of a news HTML pages text extraction algorithm based on the characteristics of human vision.The algorithm is based on the judgment of text as people.According to the factors incluing the number of Chinese characters,the number of hot words,the number of hyperlinks,a certain paragraphs of text can be determined,then by using the relationship of HTML nodes to determine the text of the news.Experiments indicate that with this method,text of the news pages can be accurately extracted and advertising and the other unrelevant parts can be well removed.Unlike traditional extraction method,not only unnecessary to adjust configuration according to different website and different channels,but also without pre-learning.
2) Design of mobile Chinese news search engine system,the concrete realization of the program,achieving a system prototype and made a number of improvements to the users’ experience as the next phase of work.
问题解答:
我来补答展开全文阅读