Data mining techniques can be used to automatically discover and extract information from Web documents, Web logs, and services. A weblog is a Web site that consists of a series of entries arranged in reverse chronological order, often updated on frequently with new information about particular topics.
As a subfield of data mining, web mining can widely be seen as the application of adapted data mining techniques to the web, whereas data mining is defined as the application of the algorithm to extract or mining of useful knowledge from large amounts of data stored in database, data warehouse, or other information repositories.
Web mining aims to discover the designs in web information by grouping and analyzing data to receive important insights. Big data act as data sets on web mining. Web data includes information, documents, structure and profile.
The web has multiple aspects that yield different approaches for the mining process, such as web pages consist of text, web pages are linked via hyperlinks, and user activity can be monitored via web server logs.
Web mining is based on two concepts defined, process-based and data-driven. In the view of Web mining data web is used to extract knowledge. In general, the use of web mining typically involves several steps: collecting data, selecting the data before processing, knowledge discovery and analysis.
Weblog mining
Pilsner: The Quintessential Light Lager
-
Pilsner, a pale and crisp lager beer, originated in the city of Pilsen
(PlzeĆ), in what is now the Czech Republic, during the mid-19th century.
Its creatio...