">
Domain Registration : Discount Domain Pricing : Website and Domain Hosting : Frontpage 2000 Hosting : Website Development and Promotion : Secure E-Commerce : Shopping Cart Software : Linux and Windows NT : Graphic Design : Free Email Services : Fast Servers : Low Cost!


RESOURCES




First, I'm not going to bore you with my life story. I'm sure you could
care less to hear it, and maybe your's is much more interesting thanmine.
However, as far as search engines are concerned, our research and
development team has spent the last six months researching new wayson how
to advertise on the web. I must admit, it wasn't an easy task to break
through the barrier of hundreds of thousands or millions of entriesthat
would come up every time you search for a word or a phrase on Altavistaor
Infoseek. With odds like these it seemed easier at first to win thelottery
than getting a top 10 listing. Then we thought that those people whoalways
come up first must be paying a lot of money for it, like a TV commercial
... the more you pay the more your site gets listed. Even though, some
search engines do charge a fee to get their customers ahead of everyone
else, with a cost of about two thousand dollars and up, by sellingkeywords
through ads. But, this is not the case here. I really do not recommendthat
you pay a penny for your listings. You can get all the hits you needfor
free, and the answer is, of course, what these techniques are all about.

Before we begin, let me assure you that there are no magic tricks to
getting your site atop search engines. The only tricks that I'm awareof
are the ones that are banned by search engines and fit the categoryof
spamming, which I would strongly advise you to avoid for your own good.
This matter will be discussed at the end of this report.

What we are here to talk about is proven methods and techniques whichwith
some effort on your part, will teach you how to get your site to thetop 10
- 20 listings in Search Engines and Directories. First, lets outlinethe
items that we will be talking about supported by variety of links toother
sites which will give you a good prospective on these subjects.

Advertising Your Business on The World Wide Web

Effective Presentation

How to Design your HTML Pages to Achieve a Top Listing

* What is HTML?
* Domain Names, Subsites and Pages (URLs)
* Titles and Descriptions
* How to build your page Titles and Subtitles for maximum results?
* Keywords: What are they and why do you need them?
* How to make a good keyword selection?
* Use of plurals in your keywords.
* Adding your competitors names and products.
* Is keyword repetition good or bad?
* What are Meta Tags and which Engines Support them?
* Relevancy Booster and Link Popularity.

Learning more about Search Engines and Directories

* Search Engines
* Differences between Search Engines and Directories.
* What makes a Search Engine better than the other?
* Is Search Engine size important to you?
* Full-text Search Engines.
* How do Robots and Spiders read and index URLs?
* Web Page Ranking.

Listing Your Pages

* Submitting your pages to Search Engines and Directories.
* How to EXCLUDE or REMOVE your pages from being listed by using:
"noindex" or "robots.txt"?
* Search Engine Freshness.
* Search Engines Chart
* Frequently Changing your Page.

Checking Your Site Status

* How to check your URL Status?
* How to measure your page's Link Popularity?

Spamming (the forbidden word)

* Spamming and Penalties "According to INFOSEEK"
* More on Spamming "origin of the word".

Final Test:
 
 

Now Let's Begin ...

1. Advertising Your Business on The World Wide Web

Advertising is always the ticket to promote your products and/or services.
Have you ever imagined how much it will cost you to advertise yourbusiness
on TV 24 hours a day, 7 days a week; Say millions of Dollars... That'san
understatement! A 30 second Prime Time TV commercial world wide willcost
over a million dollars and that is depending on the TV station andnumber
of viewers. Web advertising is a major step forward that gives a wholenew
meaning to business advertising. With a little effort and some minor
investment, you can transform your business from a local small timerto a
world wide company selling products on the web. This year, a good numberof
businesses have increased their sale on the Web from a few hundredthousand
a year to over a 100 million dollars and that is because of their
popularity. There are over 50 million Internet users today and dramatically
increasing every day. So, do you see my point? Web advertising is the21st
century way of doing business, and it's for everybody including you.The
problem is, how would you do it... Please read on.

2. Effective Presentation

The first thing you need to do is open your web site and look at itfrom a
customer's point of view; Do you like what you see? There are a numberof
questions that you have to ask yourself while designing your pages.

a. If you had a product for sale on your site, would you feel comfortable
to buy it?
b. How does it stand out among your competition?
c. How interesting is your site?
d. How long would you stay in your site if you visited it for the first
time?
e. Is your material simple to read?
f. Is it easy on the eyes to look at and browse through different topics?
g. Are there different points of interest other than your products?
h. Would you visit your site again?...

Remember, no matter how small your business is, on the Web you can lookas
big as you want to be! It all depends on your presentation.

Now, let's begin with your page layout. First, you need to make yourpages
easy and simple to read. Try to be easy on backgrounds, most peopledon't
like gimmicky backgrounds and too many animations... Don't over doit, keep
it simple, these matters do not only deviate people's attention, butit
also makes your pages load very slowly and that, by itself, is enoughto
turn people away. Keep your pages attractive and don't use too manycolors.
Display the highlights of your products or services up-front. Use the
curiosity factor - don't deny people their most precious gift.....
Curiosity! Offer some FREE information, or maybe some specials. Makeyour
site interesting for everybody. Let's say you're a Car Dealer. It doesn't
hurt to put some Driving Safety tips or some links to DMV locationsin your
area, or maybe links to Car Insurance Agencies. Also, it's always agood
idea to keep checking your competitors' sites and keep up with thechanges.
For more examples: check out our Corporate Web Site:
http://www.ecstec.com/. We know that it is not perfect, but it mightgive
you some ideas.

3. How to Design your HTML Pages to Achieve a Top Listing

* What is HTML

HyperText Markup Language is a programming language composed of a setof
elements that define a document.

Click on and bookmark the following links for a quick reference on HTML:
http://kuhttp.cc.ukans.edu/lynx_help/HTML_quick.html

The following Links will show you a list of HTML Codes that you mayfind
interesting:
http://union.ncsa.uiuc.edu/HyperNews/get/www/html/code-index.html

More links if you want to learn HTML:

http://www.fuqua.duke.edu/internal/AUTHORS/primer.htm

http://info.rutgers.edu/newark/web.tutoria

Software Download Link:

http://www.utoronto.ca/webdocs/HTMLdocs/intro_tools.html

* Domain Names, Subsites and Pages (URLs)

Many people often miss the value of their URL name, even though it is
difficult to find an unregistered descriptive Domain name such as:
www.forsale.com. However, you can always get more descriptive withyour
Subdomain name and pages. For example, if you sell Computer Memory,it will
be much better to make your URL like this:
www.yoursite.com/computers/memory.html instead of
www.yoursite.com/Johndoe/mypage.html. Names like these are easier to
advertise because they are self explanatory and can be categorizedbetter
by Search Engines. You will greatly increase your "findability" factoronce
you have narrowed it down to Computers and Memory. And at the sametime,
you would have gained two keywords that will increase your site's relevancy
factor. Also, it's easier to remember when people want to visit yoursite
again without having to search.

* Titles, Descriptions and Density Ratio.

Of course, a good name is not nearly enough to be found on the Web.Titles
and Descriptions are the most important factors in your advertising.

Page Title:

Page Title is the one that shows on the top frame of your Microsoft
Explorer or Netscape Browser and the first bold lettered line on Infoseek
or AltaVista's search results.

This is just an example:

USA Today--New device will turn PCs into virtual ATMs
NEW YORK - A group of companies on Tuesday announced they have developed
a new device that can turn almost any personal computer into an automatic
teller machine. Called MoneyClip, ...
26% http://www.usatoday.com/life/cyber/tech/cta219.htm (Size 3.8K)

* How to build your page Titles for maximum results?

Rule # 1. Make your page title clear and straight to the point. Nothingis
worse than a title that does not make sense. Make it concise, do notuse
irrelevant words and do not exceed 72 characters. Otherwise, your statement
is going to be chopped off from the end and look incomplete.

Titles can be listed in Meta Tags which are compatible only with some
Search Engines. Meta tag titles would look like the following:

<title>Memory Sale! Great Prices. Simms and Dimm Memory - FREE
Delivery!</title>

Rule #2. Use a different title for each one of your pages and sub-pagesand
always put some of your best keywords in them as shown in the example
below. Please note: The following two factors are considered the most
important ones that search engines and directories always watch forwhen
indexing a site or a page. The main purpose behind these factors isto
accurately categorize a page in which to be relevant to it's contents,so,
when a person searches for certain information on a subject, get the
correct results instead of a bunch of topics that have no substanceand
totaly irrelevant to their search. This matter by itself can make asearch
engine more popular than another. INFOSEEK is known to be the mostprecise
search engine when it comes to matching subjects with keywords. Also,and
for the same reason YAHOO directory has become the most used amongall
others and that is because of it's subject relevacy.

Factor #1. Most search engines list the Page Title first, and that is
because they consider it the key statement that tells most about apage.
"Read a book from it's own title." But that's not all. They also indexall
the keywords in it and in different combinations in order to give the
searcher more than one choice when looking for this information. Seethe
example below, under the word "Title".

Factor #2. In order for search engines to establish more relevancy toa
page, they do not stop by reading only it's tilte words or phrases;Some
dig in even deeper in the page to see how often these words or phrasesare
being repeated inside, then they will be indexed again according totheir
occurances. That is one of the main reasons, search engines forebid
"spamming" because some people try to cheat by listing their keywordstoo
many times hidden inside the page in one form or another in an attemptto
get more listings. Also, some would place too many words that are
irrelevant to their subject matter in the same manner, in order toget a
bigger audience to their site. There is no harm in putting a smallvariety
of keywords that might not directly relate to you subject in orderto
attract people who might be interested in your products - so long youdo
not over do it. More on this subject is explained below, under thetitles
"Keywords" and "Spamming".

Title:

For example: if I wanted to advertise Computer Memory on my site, mytitle
would look like this.

"Memory Sale! Great Prices. Simms and Dimm Memory - FREE Delivery!"

With a title like this you would have already covered a number of hot
keywords that people usually search for such as: memory sale, memorysimms,
memory dimm, memory, simms memory, dimm memory, memory sale+free delivery.
So you see how search engines index titles now. Sometimes they areeven
matched backward like: great prices+memory sale or great memory sale.Some
Search Engines consider your title and description the most relevantwhen
listing your site. They categorize your page according to its titleand
description just like in a library. Think about it for a second - ifyou go
to a public library to search for books pertaining to a certain subjecthow
would you do it? First, you will find the category of your subject- let's
say Computers. Then, you start reading the titles, then their descriptions
until you find what you need. Deep Search Engines do even more thanthat.
Their Robots or Spiders usually go deep inside a page and they readit all.
They go through all the links in an attempt to evaluate the site andto
establish an accurate assessment of it's relevancy. We will talk moreabout
this a little later.

Rule #3. Your page title is what people see first when searching theWeb.
It is perhaps the most important statement that you make to advertiseyour
site. That's what people click on to get to your site. It is the front
gate, the first impression that will either attract them to your siteor
turn them away. By making a good first impression you will make itthrough
the threshold, then we say, "Congratulations"! you got a new hit anda good
one for that matter. Use a highly descriptive title, make it appealingand
exciting. Use terms like: FREE, GUARANTEED, BEST OF A KIND etc... Your
title appeal will single you out among other listings. To get moreideas on
how to make a good title, do a search on Infoseek http://www.infoseek.com
or AltaVista http://www.altavista.com using statements that fit the
category of your business or product. Take some notes, then put yourtitle
together. Remember one thing though! Some people find it easier tocopy the
same Title off their competitors' page. In doing so you can createa slight
problem. Personally, if I find two sites with the same title, I willmost
likely skip one of them, usually the second one. If this is the case,you
will be taking a 50/50 chance of being skipped. Also, since your competitor
was there first, it is most likely that your title will follow the
original. Why be the scavenger who picks after others if you have the
capability to be the predator yourself? It is better instead, to readtheir
titles and try to make yours more appealing. However, in order to useyour
competitors' name, title and keywords, you need to use what is called
"Subtitles." We will talk more about this later.

Page Description:

Page description is the two lines that show under your Page Title when
searching on Infoseek or AltaVista. Following is an example:

"Brand New Memory Simms. All sizes, great prices Guaranteed! Order today
and we will deliver it for FREE anywhere in the U.S. or Canada. Limited
Time Offer."

Your page description is the next best thing after your title. People
almost always read the description before clicking on the link. Todesign
your description properly, you need to follow the same rules as described
above in your Page Title. Page descriptions get listed in many different
ways. The first way is in Meta Tags. Something like this:

<meta name="description"content="Brand New Memory Simms. All sizes>

 Today and we will deliver it for FREE anywhere in
the U.S. or Canada. Limited Time Offer.">

You can place a lot of good key statements in your Description whichwill
be listed in Search Engines. You'll be surprised how many different
combinations of keywords are taken from your description.

Density Ratio:

The density ratio is the percentage of words being matched in sequence
according to the words that are being searched for. This ratio is the
percentage % amount that is normally listed under the description lineson
Infoseek search and some other engines. For example: If a person is
searching for "search engines advertising" any match found that hasall
three words in the same order gets a 100% density ratio. all othermatches,
like "search engines" will get may be 80% or 85% matching ratio. However,
this issue is not the most important factor to get you a top listing.There
are number of other factors that are more important.

* Keywords: What are they and why do you need them.

Keywords are the means by which people can find your site. They becomeyour
ticket to be recognized. By using the proper keywords you can targetthe
type of searchers that you want to bring to your site, thereby improving
its productivity. For example: If you were looking for a car on theweb,
what would be the most likely keywords to use. I would say first wouldbe
the Car Make, then the Model. For instance, "Acura Legend" would bea very
specific search. But there are many candidates who might be lookingfor a
car bargain with disregard to the make, so you might also want to list:Car
Dealers, Car Bargains, Automobiles, Vehicles, etc.. Your keywords dictate
the type of visitors you get to your site. Spending enough time putting
your keywords together will always pay.

* How to make a good keyword selection?

1. This may be the most important part of all! A good way to find keywords
that fit your business category is by checking your competitors' sites.Do
a little research of your own. Go to Infoseek, for example:
http://www.infoseek.com type in any keyword that describes your productor
service, then look at the first 10 listings found. Then, on your browser's
menu bar click on View and then Source. Scroll down through the entireHTML
Page and pick all the keywords that are relevant to your business.Do this
as many times as you feel is necessary until you come up with a good
keyword list.

2. To enhance your findability factor, look for other products thatare in
the neighborhood of your actual line. For example: If your main lineis
Women's Clothing, it will be a good idea to include "perfume" or "make-up"
in your list then go check other sites using the same method above,by
doing this, you'll end up with a whole new set to add to your list.

3. To increase the relevancy of your page, you need to include METAtag
keywords and phrases separated by commas. Get all the keywords andphrases
that you put together as I mentioned earlier and sort them out in thebest
way that describes your site or business. Use plurals anywhere that
applies. For example: search engine and search engines. The reasonis to
use the word "engines" is because you'll actually be covering both"engine"
and "engines". I would use both words in any case, to get more thanone
listing for the word. Organized Meta tag keywords can boost your relevancy
as long as you don't do too many repetitions since the overuse of repeated
keywords can actually reduce your relevancy score, and you may evenget
penalized. For more explanation, read the chapter called Spamming.

* Adding your Competitors Names and Products.

We talked earlier about how to find keywords and phrases on you
competitors' pages, and we emphasized on keeping your Title and Description
unique. However, you can use some of your competitor's Keywords, Products,
Description or even their names in hidden subtitles. For example:

This is just an example. Use your own keywords instead.

<meta name="keywords"content="john doe, products, product, best quality
items, john doe, inc., etc..">
Also, you can put more keywords in the body of your page in the formof a
Comment Tag. Something like this:

<!--your keywords and/or phrases--> you can use a few of these tagsin your
page, but always remember not to over do it. After you finish, checkyour
page in a browser and make sure that these tags are not visible inyour
page.
 
 

* What are Meta Tags and which Engines Support them?

Only some search engines like Altavista and Infoseek support META tags.
Some search engines such as Lycos read the text placed within the tags,but
it has no control over whether it's a title or a description. Normally,
most search engines read the title of a page and first 2 - 3 linesof the
body as description, and some go in deeper and pick up the words thathave
been repeated more frequently.

Meta tags are useful tools to organize your pages to get a good listing
with some search engines. They can be especially helpful for pagesthat
have little text, which will give you the opportunity to make a proper
Title, Description and Keywords without having them visible on yourpage.
Also, instead of letting the spider that visits your site pick thetitle
and description on it's own discretion, it will give you the chanceto
organize them and get more accurate results. For example: Let's sayyour
page title is "Homes and Gardens" and the header says "Flower of theMonth"
followed by a picture. If you are not using tags some search engineswill
list it like this:

Homes and Gardens
Flower of the Month

Instead you can have tags that describe what you do and give you theimage
you are trying to project without altering your page contents. Also,you
can list some good keywords that makes it possible for people to findyou.
Example:

<HEAD>
<TITLE>Homes and Gardens</TITLE>
<META name="description" content="Find out how you can turn yourgarden
into virtual paradise.">
<META name="keywords" content="flowers, gardens, exotic bouquets,red
roses, gardening, plants">
</HEAD>

This way, your listing will read as follows on search engines that support
tags:

Homes and Gardens
Find out how you can turn your garden into a virtual paradise.

Meta tags are always good to have in a page. They don't guarantee youa top
10 listing, but they will give your site a relevancy boost with somesearch
engines. Meta tags can also help your site to get closer to the topif you
include a good keyword selection that people normally use when searching
for those words. Of course, there are many other tags that are usedfor
various reasons, like "robots" tag which is used to exclude a pagefrom
being indexed but the most important ones are those of your title,
description and keywords. Remember, it is a combination of things thatget
your site a top 10 listing and Meta tags is one of them.

The following is a list of helpful links that will teach you more about
META Tags:

A Dictionary of HTML META Tags
http://vancouver-webpages.com/META/

Dr Clue's HTML Guide - Meta Tags
http://users.abac.com/cgi-bin/drclue/F1.cgi/HTML/META/META.html

HTML Writers Guild meta tag page
http://www.hwg.org/faqs/metaFAQ.html

WDVL's META Tagging for Search Engines
http://www.stars.com/Search/Meta/Tag.html

Excite's Getting Listed page
http://www.excite.com/Info/listing.html

Meta Medic
http://www.northernwebs.com/set/setsimjr.html

Meta-Tag Generator
http://www.websitepromote.com/resources/meta/

Meta Tags Can Index, Organize Your Web Pages
PC Week, 1/97
http://www.pcweek.com/ir/0113/13jia.html

* Relevancy Booster and Link Popularity.

Search engines can determine how popular a page is by analyzing thelinks
that refer to it from other pages. Some engines go through all thelinks
and list those pages accordingly. This can create a problem sometimesif
you have a page that you don't want people to see unless they go through
another page, or maybe some restricted information that you only wantto
show under certain conditions. This is where you need to use "Robots"tag
for engines that supports it and "Robots.txt" for engines that don't.Some
engines also, evaluate these links and give a relevancy boost to apage
according to it's popularity.

* Frequently Changing your Pages.

Many search engines measure how often pages change. When they find asite
that changes often they will visit it more frequently. Those that don't
change often are visited less. It will be a good idea if you keep modifying
your page maybe every month or two. It's not only good for more frequent
listings but it will also keep your pages more interesting for peopleto
visit often.

Important: It is unlikely for a Robot or Spider to visit your site unless
you were listed in their database or have some links to your site on
someone else's page that is listed. You must submit your URLs through
normal search engine submission channels before you can hope to bevisited
by their robots/spiders.
 
 

4. Learning more about Search Engines and Directories

* Search Engines.

The most common way search engines update their databases, is by using
what's called "Spiders" or "Crawlers". The spider's job is to constantly
search the web for new pages and catalog them according to it's
pre-programmed queries. Spiders normally visit pages using the URLsthat
are already indexed in their database for the purpose of updating their
information so they remain fresh. After a spider visits a page, itfirst
reads the title, then the first 2-3 lines of the body as a description,or
if a search engine supports Meta tags, then it reads the title and
description tags to see if there were any changes in content. If changes
are found it then re-indexes that page in a new listing. Some engines,such
as "Deep Search Engines" usually read the entire page they visit andall
linked subpages, then they index all words and phrases that are relevantto
the title and description of the page. Of course, this varies witheach
search engine. Some engines consider the most frequently repeated wordsor
phrases as most relevant to the page and they're index with higher
relevancy than other words, and some consider the words that are closerto
the top of the page as most relevant or maybe a combination of these
mentioned factors. But the most common between all search engines isthat
they all follow links, and through these links they visit new pagesand do
the same routine again.

Please Note: The following paragraphs are collected from different sites
and research agencies and brought to you in their original format;followed
by their own copyright and a link to their owner's page. These paragraphs
are quoted as they are for the sole purpose of comparison in whichto give
our readers a wider prospective on this subject.

The following information is quoted from:

Internic Directory and Database Services

Link: http://ds2.internic.net/tools/web-search-text.html

" Alta Vista (Web search)
Alta Vista provides simple and advanced searches on the largest Webindex:
31 million pages found on 1,158,000 servers, and 4 million articlesfrom
14,000 Usenet news groups. Because of the extensiveness of its database,
Alta Vista excels at finding obscure bits of information virtuallyanywhere
on the web.

Excite
Excite tracks down information by searching for concepts, not justby
keywords using a search technology called Intelligent Concept Extraction.
It claims to search the full text of 50 million web pages, 60,000
categorized Web site reviews, and thousands of recent Usenet postings.
Excite is among the best of the search engines at staying current.

HotBot
A joint venture between Inktomi and Hotwired, HotBot is a full-textsearch
engine powered by the Inktomi parallel-processing search engine. HotBot
claims to index over 54 million Web pages and promises to index 100%of the
Web and Usenet news and mailing lists. HotBot offers the ability tosearch
by date, resource (Java, Shockwave, VRML, etc.) and location. It supports
Boolean AND/OR/NOT and phrase searching. In addition, it provides relevance
feedback with each retrieval. Below is the simple search form for HotBot,
you can go to the original site for additional search options.

Infoseek Guide
Infoseek offers two distinct search services: Ultrasmart and Ultraseek.
Both services are powered by Infoseek's Ultra technology and allowusers to
choose the appropriate search mode depending on the level of assistance
they need. Ultrasmart lets users to find sites, topics, news and more.
Ultraseek offers what claims to be the fastest, most comprehensiveand
accurate search engine on the net.

Lycos
Lycos is one of the oldest Web search sites, and one of the best Internet
search engines in both quality of information and relevancy of hits.It is
a comprehensive catalog of Internet with more than 50 million URLs.Lycos
is a unique search site because it has large number of binary filesin its
database, including GIF, JPEG, wav, and MPEG files. It also indexesFTP
archives and Gopher menus, giving its index greater depth than mostother
search engines.

Open Text
Open Text claims to be a large database with 10 billion words and phrases
indexed. It offers an excellent combination of power search optionsand
ease of use. It is arguably the best-designed search site on the Web.Open
Text offers two search options: Simple Search and Power Search.

Web Crawler
Webcrawler builds on the philosophy: keep the database lean and displaya
clean list of results. To compile its database, WebCrawler surveysthe
entire Web, evaluating the popularity of each site and storing onlythe
contents of pages that seem well traveled. It currently covers about
500,000 Web pages. WebCrawler returns the title of the Web page, but
without a summary of description.

" End of quote

The following information is quoted from:

GIS Web Site

"Link: http://www.cs.uga.edu/~burson/gis/search.html#HDR1

Web Search Tools - A Comparison

This document is a comparison of four widely used search engines onthe
World Wide Web. They are:

* OpenText Index
* Alta Vista
* Excite
* Magellan

The analysis is divided into two parts - the objective analysis in which
all four engines are judged by the same set of seven criteria, andthe
subjective analysis in which each search engine is subjected to thesame
sequence of queries and evaluated based on the results.

[Image]

The Objective Analysis

The four search engines will be evaluated by seven criteria:

1. Distribution of Resources
2. Heterogeneity of searched material
3. Use of Metadata
4. Composability of Results
5. Synchronization
6. Efficiency
7. Effectiveness

[Image]

OpenText Index

Distribution: Indexes about 10 billion words of text on same cluster

Heterogeneity: Text, HTML, SGML

Metadata: Keeps track of words searched for most often and which pagesare
returned most often. Indexes every word of every page. Indexer creates
summaries.

Synchronization: add and update over 50,000 per day by robots

Efficiency: crawler brings in URL and every word is indexed so largeamount
of space required. claim that almost all queries take under 2 seconds.

Effectiveness: high recall but moderate precision

[Image]

Alta Vista

Distribution: Composed of 5 different servers - Main, Web Indexer, Scooter,
News Indexer, News Server. Company claims robot software to push "world's
fastest spider", Indexer Software can crunch 1 GB of text per hour.Serves
multiple queries in parallel.

Heterogeneity: Text, HTML, SGML, Newsgroups

Metadata: provides "near" operator, must maintain association of words,
ranks based on frequency, maintains update information for files

Synchronization: Updated constantly by Web robot, Indexer can index1 GB of
text per hour.

Efficiency: Total hard disk of 281 GB - 8 billion words, over 16 million
Web pages. Index over 13,000 Newsgroups updated in real time.

Effectiveness: high recall, low precision

[Image]

Excite

Distribution: Run on 8-CPU SparcCenter 1000. Indexer, index, crawleron
same filesystem

Heterogeneity: Text, HTML, Newsgroups, Reviews(local)

Metadata: Provides concept search. Maintains data on semantic association.
Summaries derived from dominant themes .

Synchronization: Index rebuilt approximately once a week by Spider

Efficiency: Indexes about 1.5 million pages

Effectiveness: high precision, moderate recall

Note: I wrote the people at Excite(Architext Software) to obtain more
information. Here is what i got.

[Image]

Magellan

Distribution: Powered by Silicon graphics workstations and servers

Heterogeneity: Text, HTML

Metadata: Maintains ratings for sites - content descriptive

Synchronization: company claims to have "planned, frequent updates"

Effectiveness:low recall, moderate precision

[Image]

The Subjective Analysis

The subjective analysis is based on a set of queries on three subjects.

1)Information on the new planet found recently:

* new planet (and)
* planet discover (and)
* new planet identif (and)

2)The latest economic forecast for Japan:

* Japan econom (and)
* Japan

3)A recent picture of the space shuttle

* space shuttle (and)
* nasa

The queries were run on each search engine and the results obtainedwere as
follows.

[Image]

OpenText Index

new planet: Found 3000 - very imprecise, no obvious links

planet discover: Found 2000 - very imprecise, no obvious links

new planet identif: Found 1500 - very imprecise no obvious links

Japan econom: Found 74 - most seemed unrelated, no obvious links

Japan: Found 37,000 - able to use "Webservers in Japan" to locate the
Economic Planning agency

space shuttle: Found 14 - most seemed unrelated, no obvious links

nasa: Found 57,000 - use "NASA homepage" to locate some pictures

Comments: The search was conducted using a Simple Search (the PowerSearch
simply allowed chaining with boolean operators). The and operator was
selected for all searches.

The Bottom Line: OpenText returns very large numbers of documents, butwith
very little precision. It would work well for searches of limited
complexity, that is queries that do not require any conceptual
interpretation.

[Image]

Alta Vista

new planet: Found 200,000 - very imprecise, no obvious links

planet discover: Found 100,000 - very imprecise, one related link, wasn't
working

new planet identif: Found 3,000 - very imprecise, no obvious links

Japan econom: Found 700,000 - what can i say?

Japan: Found 500,000 - very imprecise

space shuttle: Found 2000 - very imprecise but probably a usable one
somewhere

nasa: Found 700,000 - NASA was not in the first 20

Comments: The searches were performed using the and operator. The company
claims to place the "best matches first" but doesn't define "best".

The Bottom Line: I found these results to be similar to the OpenText
results. There is a very high recall but the documents are generallynot
relevant. I would suggest a very precise query to get good results.

[Image]

Excite

new planet: Immediate hit! - found two different articles on first page,
new planet1, newplanet2

planet discover: not necessary

new planet identif: not necessary

Japan econom: all links relevant! - found several including Economic
Planning agency

Japan: not necessary

space shuttle: Found 14 - all relevant including> NASA link

nasa: not necessary

Comments: The searches were done using the concept option, rather thanthe
keyword option. The results were very impressive. Only one query wasneeded
for each subject.

The Bottom Line: Excite is the most useful search engine that I havefound.
Using the concept search, it seems to return very precise results and
orders them according to relevance. It should be noted that Excitefound
everything relevant that the other search engines found and more.

[Image]

Magellan

new planet: Found 1000 - nothing relevant in first 30

planet discover: Found 7 - seem unrelated

new planet identif: Found 0

Japan econom: Found 26 - several somewhat related links, none obvious

Japan: Found 261 - including "Webservers in Japan"

space shuttle: Found 0

nasa: Found 232 - including NASA link

Comments: The searches were done using the and operator and the "searchfor
related words" option.

The Bottom Line: There was not a high rate of recall, as there was with
OpenText and Alta Vista and the results were not extremely precise.

[Image]

Conclusion

In conclusion, I've found all of these tools to be powerful and useful
search engines. It seems important to mention that all of these searches
and results could be largely attributed to chance. It therefore seems
likely that all of the tools would yield different results for different
queries, which makes it hard say that one search tool is "better" than
another. However, the results of this analysis seem to suggest thatExcite,
while a relatively new site, provides excellent results with high
precision. The evidence here is definitely in support of the "concept
search" for accurate results. OpenText and Alta Vista return largenumbers
of documents making it very difficult to locate the precise hits ifany are
returned. Magellan simply did not provide a high return or precisionwhen
compared to the others. Excite clearly yielded the best results forthese
queries.

"End of Quote

The following information is quoted from:

Kathryn Paul, Information Services Librarian, University of Victoria.

"Link: http://burns.library.uvic.ca/searchengineanalysis.html

Ongoing Search Engine Analysis

[last updated June 23, 1997.]

Compiled by Kathryn Paul, Information Services Librarian, Universityof
Victoria, British Columbia, Canada.

"End of Quote

The following search tips are quoted from:

"

http://www.stark.k12.oh.us/Docs/search/default.html

Major Internet Search Engines

by Judy Birmingham
Updated Feb. 18, 1997

HotBot
Indexes Web keywords, Usenet News
Classification By Subject

Boolean Search for all the words, any of the words, person's name,
or using Boolean AND, OR, NOT
Wildcards Can search for exact phrase

Notes Fast! Can limit search to media type as indicated by file
extension (e.g., .gif)

Yahoo

Indexes Title, comment words, and subject category keywords of
Internet Sites (not just Web sites)
Classification By Subject
Boolean + (require keyword). - (keyword must not be present).

Wildcards * (on right hand side of word). Exact phrases in double
quotes.

Notes Headline News, Yellow and white pages, Weather, Stock
quotes and Maps also!

InfoSeek
Web Pages, Usenet News, News Wires, Premier News (7 major
Indexes news sources: CNN, NYT, etc.), Email addresses, Company
Profiles, FAQs, Images
Classification By Subject (Infoseek Topics)

Boolean + (guarantees keyword will match)
- (eliminates documents with this keyword)

Wildcards Use double quotes to search for a phrase. Capitalize
proper names.
Notes Large Subject Directory

Alta Vista
Indexes Web documents, Usenet
Classification No subject classification
+ guarantees the keyword will match.
Boolean - eliminates documents with this keyword
Advanced Query: AND, OR, NOT, NEAR.
Weighted keyword searching.

Wildcards * at the end of keyword, Use double quotes to search for a
phrase.
Notes Digital Equipment Corporation

Excite

Indexes Web documents, last two weeks of Usenet, Reviews, Usenet
Classified Ads
Classification By Subject (Net Directory)
May use AND, AND NOT, OR, and parentheses. + (require
Boolean word). - (require word not be present). Repeat word to
emphasize its importance
No wildcards, so use multiple forms of a keyword. Will
Wildcards automatically search for synonyms and related terms. Use
capitals for names.
Notes Also has current news, city guides, yellow pages

Lycos
Indexes Web keywords, subjects, sound files, images
Classification By Subject
Boolean - (word must not appear)

Wildcards Automatically searches for plurals, variant forms of word.
Period to match word exactly. $ (on right side of word).

Notes Offers Reviews of Web pages (Top 5%), News, City Guides,
and Stock Quotes

Magellan
Indexes Submitted and reviewed web pages
Classification By Subject
Boolean AND, OR, NOT, NEAR/n, ADJ, W/n

Wildcards + (keyword must appear)
- (keyword must not appear)
web pages are rated (1-4 stars)
Notes [green light icon] indicates kid safe pages

WebCrawler
Indexes Web page titles and summaries
Classification By Subject

Boolean AND, OR, NEAR (within 25 words of), NOT, ADJ, and
parentheses.

Wildcards Exact word match. Use double quotes for exact phrase
match.
Notes

EINET Galaxy
Indexes Web and telnet resources
Classification By Subject
Boolean AND, OR, NOT
Wildcards * at the end of words

Notes Resources approved for inclusion. Target audience:
Professionals

Open Text Index
Indexes web documents
Classification No subject classification scheme

Boolean Power Search: AND, OR, NOT, NEAR, FOLLOWED BY
Weighted Search allows you to weight keywords
Wildcards No wildcards, so use multiple forms of a keyword
Notes

Reference.Com
Indexes Usenet Newsgroups, Listservs, Mailing Lists, and Webforums
Classification Only by newsgroup hierarchy so far.
Boolean AND, OR , NOT, or NEAR with OR being the default
All keyword searches will default to finding all words
Wildcards with the same word stem. * represents one or more
characters. Use single quotes for exact phrases.

Notes This search engine will locate webforums, which are the
newest form of collaboration on Internet.

L-Soft's Catalyst
Indexes Public listserv lists on Internet
Classification Search by host, country, listname, or list title

Boolean Exact phrase match. Use a comma (,) as an OR between
words.
Wildcards No wildcards
Notes Updated daily. Currently at 12,000+ lists

Liszt: Directory of E-mail Discussion Groups

Indexes List of listservs, listprocs, majordomo and independent
mailing lists plus Newsgroups
Classification By subject

Boolean AND implied between keywords, may also use OR, NOT, and
parentheses ()

Wildcards Keywords treated as substsrings. * is wildcard.
Use quotes (") to match phrases.

Notes Indexes listservs, listproc, and majordomo mailing lists.
Some will be private.

List of Lists

Indexes Mailing lists run with LISTSERV, LISTPROC, MAJORDOMO, and
MAILBASE software
Classification None
Boolean Implied AND between all keywords
Wildcards No wildcards

Notes This is the classic list of mailing lists from early
Internet days.

List of Listservs
Indexes Mailing lists run with LISTSERV software

Classification By List Name, Subject, Keyword, Host Country, Host
Organization, and by Number of Members
Boolean Implied OR between all keywords
Wildcards No wildcards
Notes Does not include LISTPROC and MAJORDOMO mailing lists

Publicly Available Mailing Lists
Indexes Mailing lists (email discussion groups)
Classification Alphabetical, by Name and Subject Keyword
Boolean None
Wildcards None
This is an alphabetical list of mailing lists (discussion
Notes groups). These are public, but they are not necessarily
automated like listservs.

DejaNews
Indexes USENET news postings
Classification None
| (or)
& (and)
Boolean &! (and not)
( ) (for nesting)
{car cat} Alphabetical range of words between car and cat
Allows author, subject, date filters
Wildcards * as wildcard

Notes Defaults to the last several weeks of articles. Can select
last two years worth.

USENET FAQS
Indexes FAQs which are posted to news.answers
Classification By Subject, Newsgroup Name

Boolean You may search for newsgroup name, archive name, or
keyword
Wildcards Exact word matching
Notes Maintained at Ohio State

Whowhere?
Indexes Personal or Organizational Names and email addresses
Classification No classification scheme
Boolean Supply personal name and if possible, his organization
Wildcards Fuzzy Search will approximate name spellings
Notes None

Four11
Indexes Personal name, domain, city, state, country
Classification No classification scheme
Boolean All fields are optional

Wildcards Smart Search will approximate name spellings, equate "Bob"
with "Robert" etc.
Notes Updated daily; addresses verified yearly.

Finger Gateway
Indexes Personal name, username, computer host names
Classification No classification scheme
Boolean Simply enter username and host.domain in the blanks
Wildcards Can supply just a portion of the person's name

Notes Returns the user's identity, email address, and often
other information.

Netfind
Indexes Locates people when you don't know the host computer
Classification List of Netfind Servers

Boolean Must enter user's last name plus a few keywords indicating
geographical or organizational location

Wildcards Login as "netfind" at the prompt, and follow onscreen
instructions
Notes Select "Seed Search" to search for an organization

Metasearch Sites
Metacrawler
Search.Com
Savvy Search
All In One Search
Yahoo's List of Search Engines



Reference Collection

Ohio Colleges
A list of 130 universities and colleges in Ohio.
General Reference (Yahoo)
Lists of dictionaries, encyclopedias, fact books, etc. on Internet.
General Reference (EINET)
Lists of dictionaries, encyclopedias, fact books, census data, etc.on
Internet
Internet Public Library
A virtual library: reference works, young adult links, services for
librarian, interactive internet instruction and online courses, etc.
Carrie: Full Text Electronic Library
General Reference, Electronic books, Electronic Newspapers and
Magazines, Government Documents, etc.
USENET FAQ Archive
Archive of current USENET newsgroup Frequently Asked Question files.
Libraries on Internet
Index to academic, school, and public libraries and library resources
on the Net.
Organization for Community Networks and Freenets
Community based systems providing local information resources, limited
access to Internet, and k12 programs such as The Academy Program.
Electronic Texts
Texts and Hypertexts of thousands of classics available on the
Internet.
Sites for Educators
A short list of interesting sites of interest to K-12 teachers, such
as ERIC and Classroom Connect.



" End of Quote

The following information is quoted from:

"Link: http://www.onlineinc.com/onlinemag/MayOL/zorn5.html

Advanced Searching: Tricks of the Trade

by
Peggy Zorn, Mary Emanoil, and Lucy Marshall
Parke-Davis Pharmaceutical Research Library

and

Mary Panek
United Technologies Research Center

ONLINE, May 1996
Copyright Online Inc.

"End of Quote
 
 

Very informative links:

Press Release from:

http://www.positionagent.com/prparsch.htm

http://www.state.wi.us/agencies/dpi/www/search.html

http://www.altavista.digital.com/

http://www.altavista.magallanes.net

http://altavista.skali.com.my

Excite
http://www.excite.com/

Excite Power Search
http://www.excite.com/search/options.html?a-opt-t

Excite offers custom search tools
http://www.news.com/News/Item/0,4,11353,00.html

Infoseek Industry Watch
http://industrywatch.infoseek.com

LookSmart
http://www.looksmart.com/

LookSmart Site Submission Form
http://www.looksmart.com/h/info/submsite.html

WebCrawler
http://www.webcrawler.com/

WebCrawler Add URL Help Page
http://www.webcrawler.com/WebCrawler/
Help/GetListed/HelpAddURL.html

WebCrawler Add URL Page
http://www.webcrawler.com/WebCrawler/
Help/GetListed/AddURLS.html
 
 

* Differences between Search Engines and Directories.

[Image] http://www.cnet.com/Content/Features/Dlife
/Search/ss02.html
 
 

* Web Page Ranking.

Here are some good links that will give you wealth of information onWeb
Page Ranking and other beneficial topics.





http://www.iglou.com/sgrant/hit/
 
 

* How to EXCLUDE or REMOVE your pages from being listed by using:
"noindex" or "robots.txt"?

Webmasters are always faced with the need to exclude certain pages ora
whole site from being automatically indexed by search engines for many
reasons. If you have a page/s that you need to exclude from searchengines,
such as: pages that are restricted to a certain audience, or maybepages
that can only be viewed with subscription, etc. You will need to consider
the following options:

To exclude a page from being indexed with search engines that utilizeMeta
tags, you can use a tag such as:

<META NAME="ROBOTS" CONTENT="NOINDEX"> in the <Head> of your HTMLpage that
you want to exclude.

For example:

<html>
<head>
<meta name="robots" content="noindex">
<meta name="description" content="Your Page Description ....">
<titleYour Page Title ...</title>
</head>
<body>

To exclude a page and all the links that are in it, then you need touse
the following tag

<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">

For example:

<html>
<head>
<meta name="robots" content="noindex,nofollow">
<meta name="description" content="Your Page Description ....">
<title>Your Page Title ...</title>
</head>
<body>

Additional info.:

http://info.webcrawler.com/mak/projects/
robots/meta-notes.html

As for search engines that do not support tags, then you need to use
"robots.txt"

Here's some example links:

http://info.webcrawler.com/mak/projects/
robots/exclusion-admin.html

http://www.w3.org/pub/WWW/Search/9605-Indexing-
Workshop/Papers/Frumkin@Excite.html
#Complex_directory_structures

http://info.infoseek.com/robots.txt

http://info.webcrawler.com/mak/projects/
robots/robots.html

* Search Engines Chart

Please follow this link for details:

[Image]

http://searchenginewatch.com/features.htm
 
 

7. Spamming (the forbidden word)

* Spamming and Penalties "According to INFOSEEK"

Quoted as is from Infoseek forum:

http://www.infoseek.com/Help?sv=IS&
lk=noframes&pg=meta_tag.html#RESTRICTIONS

What is Spamming?

Please note that some Web authors use techniques to artificially inflate
the relevance scores for their sites. This practice is often referredto as
spamming. Infoseek's index detects common spamming practices and penalizes
pages that use them. Unfortunately, in some cases this may cause valid
pages to be penalized as well.

For best results, the following Web publishing techniques should be
avoided:

Overuse or repetition of keywords. Use of meta refresh faster than the
human eye can see Use of colored text on same-color background Useof
keywords that do not relate to the content of the site Duplicationof
pages with different URLs

Repetitive submission of spammed pages may result in a permanent exclusion
from Infoseek's index.
 
 

8. Final Test:

Now its time to put your page together using what you've learned

Here is a sample page to guide you through the process: Make sure to
replace the words within the Bold lettered TAGS, your own Title,
Description and Keywords that you put together according to my instructions
earlier on this page. Also, make sure that the tags are used exactlyas
shown; otherwise, the text becomes visible inside your page. Alwayscheck
your pages using a browser before publishing to verify their integrity.

The following procedure needs to be done inside your HTML page by clicking
on View in your FrontPage Editor or similar editors, then go to HTMLand do
your changes there.

Use of Title, Description and Keywords within the head of your HTMLpage.

<head>

" Title: Do not ecceed 72 characters in your title including spaces."

<title>Search Engines Secrets and Website Promotions - Guaranteed
Hits</title>

"Description: Do not exceed 200 characters in your description including
spaces."

<meta name="description"content="Discover The Most Powerful and Proven
strategies used to give your website a Top 10 - 20 Search Engine Listing.
The Best FREE Website Promotion Tool for guaranteed hits...">

Keywords: Do not repeat same keywords more than 7 times

<meta name="keywords"
content="website, promotions,promotion,search, engines, results,
advertising, businesses, advertising, websites, marketing,promotions,
promoting, searches, engines, listings, opportunities, search, listings,
search, engine,web site, traffic, homepages, hits, generating, website,
traffic, yellow, pages, classifieds, advertising, promotional, strategies
">
</head>

Additional Keywords sample that can be used
inside the body of your HTML page:

<body>
<!--Website Promotion. website, promotions,promotion,search, engines,
results, advertising, businesses, advertising, website,
marketing,promotions, promoting, searches, engines, listings,
opportunities, search, listings, search, engine,website, traffic,
homepages, hits, generating, website, traffic, yellow, pages, classifieds,
advertising, promotional, strategies -->

<!--Robots and Spiders - Search Engine Secrets, Listings and Positioning.
Website Promotion. Guaranteed Results! Discover Powerful And Proven
strategies that Really Work To get you a TOP Search Engine Listing.
Free Advertising, Free advertising Free reports, Free Web Site Promotion,
Free Web Site Submission, Free Web Site Marketing, Free Web Site Placement,
Free Web Site Services, Free Advertising, Free Internet Advertising,Free
Website Placement, Free Internet Publishing, Free information, Freeon-line
business Advertising -->

<body>
 
 

5. Listing Your Pages

* Submitting your pages to Search Engines and Directories.

Now the final moment has come and you're ready to list your pages with
Search Engines.

Please click on the following link to move to the listing page whereyou
will find all the resources needed to list and test your URLs. Thankyou
for your patience, and GOOD LUCK!

Submitting your pages to Search Engines and Directories.

Disclaimer

The information provided in this report is on an "as is" basis. ECS
Computer Technologies, Inc. makes no guarantee whatsover as to the
performance or the results.
Most of the techniques used in this report are substantially currentand
adequate to achieve good results with search engines if used properly.
However, some of this information may change due to the ever-changing
nature of the Internet, search engines, and other factors.
In order to achieve optimum results, customers are strongly encouragedto
read and understand this information. Results may take 30 days or moreto
become apparent.
 

This document is Copyright 1997 by Imad Hamdan, ECS Computer
Technologies, Inc.
All referring documents on this page and their copyrights are the sole
property of their original owners.
 











Copyright © 1995-2003 Sidetrips™ Internet Services, Inc. All rights reserved.

Reproduction in whole or in part in any form or medium without express
written permission of Sidetrips Internet Services, Inc. is prohibited.

sidetrips.com™, hostserver.com™, registera.com™, htmlshop.com™,
and associated domains are trademarks of Sidetrips™ Internet Service, Inc.

Privacy Policy | Acceptable Use Policy | Terms of Service