Before reading further... Are you looking for great Linux hosting from a company that cares about GNU/Linux? Pick Dreamhost hosting, get a 10% bonus to the disk space (and support Free Software Magazine in the meantime!)

Just a thought: free distributed search?

FSM Columnist: Trusted

Write a full post in response to this!


Every once in awhile, I just get a hare-brained notion. Today’s was, why do we use a central website for doing internet searches at all? Why Google?

Consider the success of the Planetary Society’s distributed SETI project, and the distributed computing architecture that resulted from it. Consider the success of swarming download technology like BitTorrent. Consider how simple a basic web spider could be. Consider the efficiency of spidering networks locally. Consider the architecture of DNS.

See a pattern?

What if we replaced the concept of a search engine site with a search engine protocol? What if we ran small spidering operations on thousands of sites around the world instead of putting a massively parallel supercomputer in one room somewhere to do it? The individual spiders would be intelligent applications that learned their immediate environment, and then shared that data with others. Each person using the software could send queries into it, and it would propagate up through a series of spiders to find the best sources of information on the subject.

Probably, you’d still need central indexes somewhere. But what if the index servers where run by lots of people, and not just one company?

It would be a whole new architecture, of course, and there’s probably some weaknesses to it, but the idea of a peer-to-peer based search network with peer applications sharing both the indexing and querying load with each other does seem feasible — after all, distributed computing is able to capture more computing power more cost-effectively than just about any supercomputer architecture, so the power to do it is probably there.

Makes me wonder if someone is already building it.

Just a thought…

Write a full post in response to this!

Similar articles

0

Do you like this post?
Vote for it!

Copyright information

This entry is (C) Copyright by its author, 2004-2008. Unless a different license is specified in the entry's body, the following license applies: "Verbatim copying and distribution of this entire article is permitted in any medium without royalty provided this notice is preserved and appropriate attribution information (author, original site, original URL) is included".

Biography

Terry Hancock: Terry Hancock is co-owner and technical officer of Anansi Spaceworks, dedicated to the application of free software methods to the development of space.

Alan Berg's picture

Reminds me in a strange way of geocaching

Submitted by Alan Berg on Mon, 2006-06-26 11:45.

Vote!
0

http://www.geocaching.com/faq/

Yep, dont know why, but sort of searching for gold.

shtylman's picture

network slowdown

Submitted by shtylman on Mon, 2006-06-26 18:02.

Vote!
0

The idea may sound good, but I think that you will run into a network slowdown. If you have evry computer trying to search the network for the same things it will be inherently redundant. I think the central index idea is better. Just think, if at all times your computer network was being cralled with various spiders looking for content; just imagine the security concerns. Distributed things are good, but they do have their tradeoffs.

Terry Hancock's picture

Reprint Notice

Submitted by Terry Hancock on Sat, 2006-07-22 00:08.

Vote!
0

My blog entries at Free Software Magazine may be reprinted with this notice:
Copyright (C)2004-2006 Terry Hancock / License CC-By-SA 2.5+
http://creativecommons.org/licenses/by-sa/2.5
Originally at http://www.FreeSoftwareMagazine.com

Anonymous visitor's picture

Majestic-12

Submitted by Anonymous visitor on Sat, 2006-10-14 22:35.

Vote!
0

They have already done it:

http://www.majestic12.co.uk/

Terry Hancock's picture

I love it when that happens

Submitted by Terry Hancock on Wed, 2006-10-18 00:23.

Vote!
0

Given that this thread is way off the front page, I may have to blog that site separately.

Thanks!

Anonymous visitor's picture

Distrib Search

Submitted by Anonymous visitor on Mon, 2007-02-05 20:15.

Vote!
0

You'd have to have a large existing user base to pull this off. IMO, adding this as a component to Skype, or even better, Firefox would be a great way to go. Yes there would be some security concerns...but how big will they be? I think that if you have 10 PCs in a cloud index the same item, and 9 give one answer and 1 gives another, you toss the odd data out.

Martin Tibbitts