Search Engine

Codes: Web Crawler(RAR) | PageRank(RAR) | String Matching(RAR)

Web Crawler, PageRank and String Matching are three most important parts of a search engine. I implemented them seperately. You can download the codes from the links above. There are ReadMe files and extremely detailed commets along with the codes.

The followings are the evaluations of my implementations by www.become.com.

Web Crawler: Good implementation of crawler, robots.txt, crawl policies etc. You created data structures where simple collection classes from Java would have sufficed.
PageRank: There was an issue with Memory leak. Good use of SEL. Calculation is O(n).
String Matching: Good implementation of suffix tree. Efficient.
Overall you did very well!