Grub
href="http://www.grub.org/">Grub’s Distributed Web Crawling
Project: Much like the famous
href="http://setiathome.ssl.berkeley.edu/">Seti@home, Grub is a
distributed web crawler that aims to crawl the entire web daily!
Grub uses the power of distributed computing to build
the best search on the Web. It automatically crawls the Web in the
background, borrowing your computer’s spare clock cycles, so you
won’t even notice it’s there. The download is quick, you control
how much you crawl, and the cool screensaver shows you the
real-time progress your computer is making. You can even compare
your stats to other Grubsters in the project!
And it’s Open Source:
Open-Source is a great way to get a large, diverse
group of people working on software, and at the same time make sure
that it is secure and bug free. Security and quality are top
priority for us - we don’t want anyone’s computer compromised
because we missed something in the coding phase. We will make all
software written during the project Open Source as long as there
were external contributions to those portions of code. For the time
being however, we have chosen to fork and close source the *server*
portion of the software, due to security issues related to the
quality of URLs submitted to the system. We may choose to reopen
that source depending on the level of contribution to the project
by outside developers. Please keep in mind that the server code was
written ENTIRELY by us, and uses no GPL’d code in its current
state.
Also covered at
href="http://www.wired.com/news/infostructure/0,1377,58497-2,00.html">
Wired and
href="http://news.com.com/2100-1032-993591.html">CNet.