Welcome Guest ( Log In | Register )



2 Pages V   1 2 >  
Reply to this topicStart new topic
> Web Robots / Crawlers / Spiders Etc., All you need to know about them...
finaldesign
post Dec 12 2005, 11:34 AM
Post #1


[+] Graphic Designer [+]
Group Icon

Group: Members
Posts: 614
Joined: 6-April 05
From: Croatia
Member No.: 3,666



Hello!

While browsing the net today I found this great resource related to web robots, search engines, web crawlers, spiders and such stuff. The page is: The web robots

You can find many usefull stuff there if you are webmaster, or you are interested in making your own spider/indexer, or similar stuff. You can even look at their database of Web Robots - some of them even have their own source code, so you can compile one for yourself... rolleyes.gif
Go to the top of the page
 
+Quote Post
szupie
post Dec 12 2005, 01:06 PM
Post #2


S.P.A.M.S.W.A.T.
Group Icon

Group: Members
Posts: 814
Joined: 22-January 05
From: San Antonio, Texas (No, I'm not dumb. I just moved here...)
Member No.: 2,284



Where would you put your robot once you have it? I can't find that information on the site. Would you put it on a server (like astahost) or on your own computer? And does indexing take a lot of space and bandwidth?
Go to the top of the page
 
+Quote Post
finaldesign
post Dec 13 2005, 08:11 AM
Post #3


[+] Graphic Designer [+]
Group Icon

Group: Members
Posts: 614
Joined: 6-April 05
From: Croatia
Member No.: 3,666



I think it takes alot of space... and consumes bandwith and much of processor time...
Go to the top of the page
 
+Quote Post
szupie
post Dec 13 2005, 10:44 PM
Post #4


S.P.A.M.S.W.A.T.
Group Icon

Group: Members
Posts: 814
Joined: 22-January 05
From: San Antonio, Texas (No, I'm not dumb. I just moved here...)
Member No.: 2,284



Oh. Well, then, I'd better just have Google index my site for me, instead of wearing out Astahost's servers. Nice link, though.
Go to the top of the page
 
+Quote Post
finaldesign
post Dec 15 2005, 11:38 AM
Post #5


[+] Graphic Designer [+]
Group Icon

Group: Members
Posts: 614
Joined: 6-April 05
From: Croatia
Member No.: 3,666



Anyway many of that WEB-robots are capable to catch emails, and harvest them into database... spammers use that very often, to get targeted audience. I figured if we research some of this methods, maybe we could better protect ourself from spam and junk emails... Anyway, I'll post what I discover later here..
Go to the top of the page
 
+Quote Post
sagaxx
post Dec 26 2005, 04:05 PM
Post #6


Member [ Level 1 ]
Group Icon

Group: Members
Posts: 33
Joined: 25-December 05
From: Bucharest
Member No.: 10,286



google robots are the best robots and it has like hundreds of them , it uses a lot of bandwith so it isn`t good for little sites , they are good for sites like google , msn etc
Go to the top of the page
 
+Quote Post
finaldesign
post Dec 27 2005, 12:47 PM
Post #7


[+] Graphic Designer [+]
Group Icon

Group: Members
Posts: 614
Joined: 6-April 05
From: Croatia
Member No.: 3,666



QUOTE(sagaxx @ Dec 26 2005, 06:05 PM)
google robots are the best robots and it has like hundreds of them , it uses a lot of bandwith so it isn`t good for little sites , they are good for sites like google , msn etc
*


well idea of this is testing and learning, anyway if you know how many spiders work, you will be able to make your web pages better and that way increase your page rank on search engines - and that's what we all want rolleyes.gif
Go to the top of the page
 
+Quote Post
YudzzY
post Dec 27 2005, 01:28 PM
Post #8


Member - Active Contributor
Group Icon

Group: Members
Posts: 80
Joined: 5-September 05
Member No.: 8,327



the link is good, but not much of my use. i will try to see how to make it in use for futur..
for the time being i will let google do the work for me ;-)
Go to the top of the page
 
+Quote Post
Khymnon
post Jan 3 2006, 07:33 PM
Post #9


Member [ Level 2 ]
Group Icon

Group: Members
Posts: 72
Joined: 1-January 06
From: Egypt
Member No.: 10,410



I think that, with the current state of search engine optimization, we should let the SEs do the indexing themselves. Unless one knows exactly how every single SE works to index and rank their site, one will most likely hurt his ranking at one engine or another.

I remember one time when Google used H1 and TITLE tags as a primary criteria for their ranking, while MSN had them at 4th and 6th. And right now, Google mainly uses a system called Vector Analysis, where they analyse the overall theme of your Website, and adjust your ranking accordingly. It's still work-in-progress, but Google partly uses it, while not many others do.

So my point is, until SEs can reach a certain level of standardization, we should let each SE do what it likes most. Plus, it doesn't really take that much bandwidth, not more than any hungry visitor to your site would take.
Go to the top of the page
 
+Quote Post
szupie
post Jan 3 2006, 10:18 PM
Post #10


S.P.A.M.S.W.A.T.
Group Icon

Group: Members
Posts: 814
Joined: 22-January 05
From: San Antonio, Texas (No, I'm not dumb. I just moved here...)
Member No.: 2,284



QUOTE(Khymnon @ Jan 3 2006, 01:33 PM)
I remember one time when Google used H1 and TITLE tags as a primary criteria for their ranking, while MSN had them at 4th and 6th.  And right now, Google mainly uses a system called Vector Analysis, where they analyse the overall theme of your Website, and adjust your ranking accordingly.  It's still work-in-progress, but Google partly uses it, while not many others do.
*



When you say "theme", do you mean the content theme or the visual theme of the site? And how can the robot adjust the ranking with the H1, title or theme? I don't get how a computer can tell what is good and what is bad.
Go to the top of the page
 
+Quote Post

2 Pages V   1 2 >
Reply to this topicStart new topic

Collapse

> Similar Topics

Topics Topics
  1. Meta Tags(17)
  2. Robots Meta Tag Introduction(5)
  3. Are Robots Considered Humans?(55)
  4. Are Viruses Considered As 'alive'(45)
  5. Do Google Crawlers Index Our Sites?(1)
  6. New Biological Robots Build Themselves(20)
  7. Test Your Robots.txt File With Google(6)
  8. Can Search Engine Spiders See Dynamic Content?(1)
  9. Google Spiders Scan Astahost Daily(3)
  10. The Spider Catcher(5)


 



- Lo-Fi Version Time is now: 4th July 2008 - 10:28 PM