In computer science, “Data Mining” can be referred to as knowledge mining from data, knowledge extraction, data/pattern analysis, data archaeology, and data dredging. Web mining aims to discover and retrieve useful and interesting patterns from large data sets and classic data mining.
Web mining methods are divided into three categories: web content mining, web structure mining and web usage mining.
The extraction of certain information from the unstructured raw data text of unknown structures is referred to as Web content mining.
It is the process of extraction of information from the content of the web documents. Web content consist of several types of data – text, image, audio, video etc. Content data is the group of facts that a web page is designed. It can provide effective and interesting patterns about user needs.
Web data are generally semi-structured and/or unstructured, while data mining is primarily concerned with structured data. With a large amount of data that is available on the World Wide Web, content mining supports the results lists to search engines in order of largest applicability to the keywords in the query.
Web Content Mining Techniques:
*Pre-processing
*Clustering
*Classifying
*Identifying the associations
*Topic identification, tracking, and drift analysis
Web content mining
Enron: Rise, Scandal, and the Legacy of Corporate Greed
-
Enron Corporation, once a giant in the energy industry, rose to prominence
through innovative strategies and rapid expansion, only to collapse under
the we...