AWStats is a server log processing and analyzer program. It can generate text files with several types of statistics computed from the values of the processed logs.
This class can be used to parse AWStats files and extract several types of statistics values.
The extracted statistics values are stored in class array variables. It can extract statistics about visits and visitors, referers, ranges, sections, etc..
This class can be used to extract important words from HTML documents.
It can process a well-formed XHTML document and extract the words contained in the document.
The class gives scores to each word depending on conditions like, whether the first letter is upper case, whether the word is inside strong or bold tags, etc..
It returns an associative array of words sorted by importance score.
This class can parse and analyze HTML documents with multi-byte characters.
It uses the DOM document and php_mbstring extension to parse multi-byte HTML documents.
The class can also analyze the parsed documents and return the document encoding, DOCTYPE, base URL, meta tags, title, links, images, document text, words in tags h1, title, a, a title, img alt, words weights and density.