Data mining techniques can be used to automatically discover and extract information from Web documents, Web logs, and services. A weblog is a Web site that consists of a series of entries arranged in reverse chronological order, often updated on frequently with new information about particular topics.
As a subfield of data mining, web mining can widely be seen as the application of adapted data mining techniques to the web, whereas data mining is defined as the application of the algorithm to extract or mining of useful knowledge from large amounts of data stored in database, data warehouse, or other information repositories.
Web mining aims to discover the designs in web information by grouping and analyzing data to receive important insights. Big data act as data sets on web mining. Web data includes information, documents, structure and profile.
The web has multiple aspects that yield different approaches for the mining process, such as web pages consist of text, web pages are linked via hyperlinks, and user activity can be monitored via web server logs.
Web mining is based on two concepts defined, process-based and data-driven. In the view of Web mining data web is used to extract knowledge. In general, the use of web mining typically involves several steps: collecting data, selecting the data before processing, knowledge discovery and analysis.
Weblog mining
Enron: Rise, Scandal, and the Legacy of Corporate Greed
-
Enron Corporation, once a giant in the energy industry, rose to prominence
through innovative strategies and rapid expansion, only to collapse under
the we...