Title: your site's title.
Author: your name or company name.
Copyright: your copyright notice.
Keywords: default keyword list for when extraction is being skipped. You'll probably want to add these to your wanted list below too.
Description: description of the site as it should appear on search engines.
Rating: is used to give the web page a rating for the appropriateness for kids. The ratings are, general, mature, restricted, and 14 years.
P3P Compact Policy: this allows you to add your compact P3P policy to the header information. More info on P3P can be found P3P Compact Policy.
Leave blank if unsure!
Robots: Whether robots should index the page and stop or not index but follow links, etc. index/follow is best.
Pragma: Turned on, it tells browser & proxies not to cache your page. Has no effect on search engines.
Bookmark Icon: IE uses this when somone bookmarks the page to display a cute icon in the browser bar. Will probably be ../../favicon.ico on any xoops machine.
Enter any other meta data or javascript you want to appear between your <head> </head>.
DO NOT enter any normal html here, this is only for <meta>, <script>, <style>, etc.
For example, you may want to add your ICRA rating tag in here. ICRA online generator.
Simply replaces the default meta/footer textbox. Use this instead.
Wanted: these words are added to the keywords just before shuffling. Like this you can have certain keywords repeated frequently across the entire site. Can also contain 2+ word keywords, and words smaller than 4 letters.
Unwanted: all these words will be removed from the extracted words as to keep the keywords as relevant to the page in question as possible. No use adding words smaller than 4 letters big here, since they are automaticly stripped anyways.
Note: in order to add a word, simply fill in the blank box and hit the right/left arrow, to move words between lists, just select the words and use the arrows, and to remove, just select words and hit remove.
Extractor: this simply turns dynamic keyword extraction on or off.
Cloaking: if turned on (recommended), then keywords are not extracted if a normal visitor is detected, thus saving horsepower.
Max. Depth: how deep you want the extractor to crawl into links. For example in downloads, it could grab all text from xx different downloads present on the page, and compile the keyword list from that.
Max. Words: limits the maximum number of keywords to generate.
User Agents :these are normal browser codes. If HTTP_USER_AGENT does not matches any of these...then we skipextraction since it's not a search engine viewing the page. Cloaking must be turned ON to make usage of this feature.
Browse to any plain text file on your computer containing keywords delimited by spaces, newlines, commas, pipes..and import them into one of the above lists.
The below describes in a brief manner what the keyword extractor does.
If a search engine is detected:
We uppercase because a search for:
"xoops" will find "xoops, Xoops, XOOPS"
"Xoops" will find "Xoops" and eventually "XOOPS"
So if a user searches in lowercase he'll still find your word, but if he uppercases his 1st letter you loose... Like this we catch both users that may uppercase the 1st letter of a word ANDthose that search in lowercase.
If a normal user is detected (no search engine) & cloaking is turned ON, then we skip all the above and display normal keywords, thus saving cpu/mysql consumption.
Extra Links:
http://searchengineforums.com
http://searchenginewatch.internet.com
Not a complete list, but should help you understand whats going on here :o)
© 2006 The E-Xoopport Project - This manual is under the terms of the GNU/FDL License