Topic > Web Usage Mining - 1146

Many people interact with websites around the world every day. A huge amount of data is generated and this information could be highly respected by the company in the field of accepting Customer behaviors. Web usage extraction is a relatively independent, but not isolated, category that primarily describes techniques that discover the user's usage pattern and attempt to predict their behaviors. Web Usage Mining is the area of ​​data mining that deals with the novelty and study of usage patterns with the use of web log data. Especially web logs to advance web-based applications. User identification is used to identify who accesses the website and which pages are accessed. If users are logged in with their information, it is easy to identify users. Indeed, there are masses of users who do not register their information. In fact, there are a large number of users accessing websites through agents, numerous users use the same computer, there is a firewall, independent users use different browsers, and so on. All the difficulties make this work extremely complicated and very hard, to accurately identify each unique user. We may use cookies to track user behavior. But considering someone's privacy, many users do not use cookies, so you need to find other methods to solve this problem. For users who use the similar computer or use the similar agent, how to find them? As presented in [9], it uses the heuristic method to solve the problem, if a page is requested that is not directly reachable via a hyperlink with some of In the pages visited by the user, the experiential assumes that there is another user with the same computer or with the same IP address. Doru Tanasa and Brigitte Trousse [4] present a method called navigation......middle of paper......consideration from the web server. Designed to allow companies to use cookies to learn the behavior of visitors online. However, check the convenience of methods to control the currency of cookies on your computer, these are often limited by users. USER IDENTIFICATION USING THE REFERRAL LOG The method used here is this. The REFERER_URL parameter collected with the access log and site topology are used to conceptualize the navigation traces for each user viewed (Cooley et al. 1999). If a new page appears after the set of pages that is not accessible from the previously viewed pages, a new user is brought forward. A further condition for which a new user is expected is when in a path of pages already viewed there appears to be a page already navigated. This situation is very limited and not precise. It does not receive repeated pages in the same user in the same session, which is very public in real life.