Seol mar théacs é seo: An Improved Data Clustering Algorithm for Mining Web Documents