seo glossary, seo terms

AJAX - Asynchronous JavaScript and XML is a technique which allows a web page to request additional data from a server

without requiring a new page to load.

Absolute link - A link which shows the full URL of the page being linked at.

Example absolute link

Cool Stuff

Example relative link

Cool Stuff

Alexa - Amazon.com owned search service

AllTheWeb - Search engine which was created by Fast, then bought by Overture, which was bought by Yahoo.

AltaVista - Search engine bought out by Overture prior to Overture being bought by Yahoo

AOL - Popular web portal which merged with Time Warner.

Ask - Ask is a search engine owned by InterActive Corp. They were originally named Ask Jeeves, but they dumped Jeeves in

early 2006. Their search engine is powered by the Teoma search technology, which is largely reliant upon Kleinberg's concept

of hubs and authorities

Behavioral Targeting - Ad targeting based on past recent experience and/or implied intent. For example, if I recently

searched for mortgages then am later reading a book review the page may still show me mortgage ads.


Block Level Analysis - A method used to break a page down into multiple points on the web graph by breaking its pages down

into smaller blocks for further link analysis.

Canonical URL - The canonical version of any URL is the single most authoritative version indexed by major search engines.

Examples of URLs which may contain the same information in spite of being at different web addresses:

* http://www.seobook.com/
* http://www.seobook.com/index.shtml
* http://seobook.com/
* http://seobook.com/index.shtml
* http://www.seobook.com/?tracking-code


CGI - Common Gateway Interface - interface software between a web server and other machines or software running on that

server. Many cgi programs are used to add interactivity to a web site.


Cloaking / IP delivery - Displaying different content to search engines and searchers

Clustering - Clustering can also refer to a technique which allows search engines to group hubs and authorities on a specific

topic together to further enhance their value by showing their relationships.


Co-citation - In topical authority based search algorithms links which appear near one another on a page may be deemed to be

related to one another. In algorithms like latent semantic indexing words which appear near one another often are frequently

deemed to be related.

Copyright - The legal rights to publish and reproduce a particular piece of work.

Cookie - Small data file written to a user's local machine to track them. Cookies are used to help websites customize your

user experience and help affiliate program managers track conversions.

CSS - Cascading Style Sheets is a method for adding styles to web documents.

Note: Using external CSS files makes it easy to change the design of many pages by editing a single file. You can link to an

external CSS file using code similar to the following in the head of your HTML documents




Cybersquatting - Registering domains related to other trademarks or brands in an attempt to cash in on the value created by

said trademark or brand.


Dayparting - Turning ad campaigns on or off, changing ad bid price, or budget constraints based on bidding more when your

target audience is available and less when they are less likely to be available.

DNS - Domain Name Server or Domain Name System. A naming scheme mechanism used to help resolve a domain name / host name to a

specific TCP/IP Address.


Everflux - Major search indexes are constantly updating. Google refers to this continuous refresh as everflux.

In the past Google updated their index roughly once a month. Those updates were named Google Dances, but since Google shifted

to a constantly updating index Google no longer does what was traditionally called a Google Dance.


FFA - Free for all pages are pages which allow anyone to add a link to them


Frames - A technique created by Netscape used to display multiple smaller pages on a single display. This web design

technique allows for consistent site navigation, but makes it hard to deep

Fuzzy Search - Search which will find matching terms when terms are misspelled (or fuzzy).

GAP - Google Advertising Professional is a program which qualifies marketers as being proficient AdWords marketers.

Google Bombing - Ranking page well for a specific search query by pointing hundreds or thousands of links at it with the

keywords in the anchor text.


Google Bowling - Knocking a competitor out of the search results by pointing hundreds or thousands of low trust low quality

links at their website. Typically it is easier to bowl new sites out of the results. Older established sites are much harder

to knock out of the search results.

Google Dance - In the past Google updated their index roughly once a month. Those updates were named Google Dances, but since

Google shifted to a constantly updating index, Google no longer does what was traditionally called a Google Dance. Major

search indexes are constantly updating. Google refers to this continuous refresh as everflux.


Google OneBox - Portion of the search results page above the organic search results which Google sometimes uses to display

vertical search results from Google News, Google Base, and other Google owned vertical search services.

Google Sitelinks - On some search results where Google thinks one result is far more relevant than other results (like

navigational or brand related searches) they may list numerous deep links to that site at the top of the search results.

Google Supplemental Index - Index where pages with lower trust scores are stored. Pages may be placed in Google's

Supplemental Index if they consist largely of duplicate content, if the URLs are excessively complex in nature, or the site

which hosts them lacks significant trust.

Hilltop - Algorithm which ranks results largely based on unaffiliated expert citations.

HITS - Link based algorithm which ranks relevancy scores based on citations from topical authorities.

Hijacking - Making a search engine believe that another website exists at your URL. Typically done using techniques such as a

302 redirect or meta refresh.

.htaccess - Apache directory-level configuration file which can be used to password protect or redirect files.

IDF - Inverse Document Frequency is a term used to help determine the position of a term in a vector space model.

IDF = log ( total documents in database / documents containing the term )

Inktomi - Search engine which pioneered the paid inclusion business model. Inktomi was bought by Yahoo! at the end of 2002.

JavaScript - A client-side scripting language that can be embedded into HTML documents to add dynamic features.

Link Baiting - The art of targeting, creating, and formatting information that provokes the target audience to point high

quality links at your site.

Link Bursts - A rapid increase in the quantity of links pointing at a website.

Link Churn - The rate at which a site loses links.

Link Equity - A measure of how strong a site is based on its inbound link popularity and the authority of the sites providing

those links.

Link Hoarding - A method of trying to keep all your link popularity by not linking out to other sites, or linking out using

JavaScript or through cheesy redirects.

Link Popularity - The number of links pointing at a website.

Link Rot - A measure of how many and what percent of a website's links are broken.

Looksmart - Company originally launched as a directory service which later morphed into a paid search provider and vertical

content play.

LSI - Latent Semantic Indexing is a way for search systems to mathematically understanding and representing language based on

the similarity of pages and keyword co-occurance. A relevant result may not even have the search term in it. It may be

returned based solely on the fact that it contains many similar words to those appearing in relevant pages which contain the

search words.

Mechanical Turk - Amazon.com program which allows you to hire humans to perform easy tasks that computers are bad at.

Meta Refresh - A meta tag used to make a browser refresh to another URL location. A meta refresh looks like this



Meta Search - A search engine which pulls top ranked results from multiple other search engines and rearranges them into a

new result set.

Multi Dimensional Scaling - The process of taking shapshots of documents in a database to discover topical clusters through

the use of latent semantic indexing. Multi dimensional scaling is more efficient than singular vector decomposition since

only a rough approximation of relevance is necessary when combined with other ranking criteria.

Natural Language Processing - Algorithms which attempt to understand the true intent of a search query rather than just

matching results to keywords.

Poison Word - Words which were traditionally associated with low quality content that caused search engines to want to demote

the rankings of a page.

Portal - Web site offering common consumer services such as news, email, other content, and search.

Precision - The ability of a search engine to list results that satisfy the query, usually measured in percentage. (if 20 of

the 50 results match the query the precision is 40%) Search spam and the complexity of language challenge the precision of

search engines.

Proximity - A measure of how close words are to one another.

Query Refinement -

* Google OneBox: promotes a vertical search database near the top of the search result. For example, if image search is

relevant to your search query images may be placed near the top of the search results.
* Spell Correction: offers a did you mean link with the correct spelling near the top of the results.
* Inline Suggest: offers related search results in the search results. Some engines also suggest a variety of related

search queries.

Registrar - A company which allows you to register domain names.

Reputation Management - Ensuring your brand related keywords display results which reinforce your brand. Many hate sites tend

to rank highly for brand related queries.

Reverse Index - An index of keywords which stores records of matching documents that contain those keywords.

Scumware - Intrusive software and programs which usually target ads, violate privacy, and are often installed without the

computer owner knowing what the software does.

Search History - Many search engines store user search history information. This data can be used for better ad targeting or

to make old information more findable.

Search engines may also determine what a document is about and how much they trust a domain based on aggregate usage data.

Many brand related search queries is a strong signal of quality.

SEO Copywriting - Writing and formatting copy in a way that will help make the documents appear relevant to a wide array of

relevant search queries.

Search Marketing - Marketing a website in search engines. Typically via SEO, buying pay per click ads, and paid inclusion.

Siphoning - Techniques used to steal another web sites traffic, including the use of spyware or cybersquatting.

Splash Page - Feature rich or elegantly designed beautiful web page which typically offers poor usability and does not offer

search engines much content to index.

Spyware - Software programs which spy on web users, often used to collect consumer research and to behaviorally targeted ads.

SSI - Server Side Includes are a way to call portions of a page in from another page. SSI makes it easier to update websites.

Stemming - Using the stem of a word to help satisfy search relevancy requirements. EX: searching for swimming can return

results which contain swim.

Teoma - Topical community based search engine largely reliant upon Kleinberg's concept of hubs and authorities. Teoma powers

Ask.com.

Trackback - Automated notification that another website mentioned your site which is baked into most popular blogging

software programs.

Due to the automated nature of trackbacks they are typically quite easy to spam. Many publishers turn trackbacks off due to a

low signal to noise ratio.

TrustRank - Search relevancy algorithm which places additional weighting on links from trusted seed websites that are

controlled by major corporations, educational institutions, or governmental institutions.

Usenet - A search service which is focused on a particular field, a particular type of information, or a particular

information format.

Vertical Search - A search service which is focused on a particular field, a particular type of information, or a particular

information format.

Viral Marketing - Self propagating marketing techniques. Common modes of transmission are email, blogging, and word of mouth

marketing channels.

Wordnet - A lexical database of English words which can be used to help search engines understand word relationships.

XHTML - Extensible HyperText Markup Language is a class of specifications designed to move HTML to conform to XML formatting.

XML - Extensible Markup Language is a simple, very flexible text format derived from SGML, used to make it easy to syndicate

or format information using technologies such as RSS.