omri.org
robots.txt

Robots Exclusion Standard data for omri.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	omri.org
Base Domain	omri.org
Scan Status	Ok
Last Scan	2025-05-30T10:19:03+00:00
Next Scan	2025-06-29T10:19:03+00:00

Last Scan

Scanned	2025-05-30T10:19:03+00:00
URL	https://omri.org/robots.txt
Redirect	https://www.omri.org/robots.txt
Redirect Domain	www.omri.org
Redirect Base	omri.org
Domain IPs	20.125.23.220
Redirect IPs	20.125.23.220
Response IP	20.125.23.220
Found	Yes
Hash	1b3670a9c1bf0f7873d20de61e2a65f28e022a159e6d0fc32b5c142ba0708ad6
SimHash	38949d08c774

Groups

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

screaming frog seo spider

Rule	Path
Disallow	/

Rule

Path

Disallow

robots-txt-checker

Rule	Path
Disallow	/

Rule

Path

Disallow

duckduckbot-https

Rule	Path
Disallow	/

Rule

Path

Disallow

orbbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

coccocbot-web

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/includes/
Disallow	/misc/
Disallow	/modules/
Disallow	/profiles/
Disallow	/scripts/
Disallow	/themes/
Disallow	/CHANGELOG.txt
Disallow	/cron.php
Disallow	/INSTALL.mysql.txt
Disallow	/INSTALL.pgsql.txt
Disallow	/install.php
Disallow	/INSTALL.txt
Disallow	/LICENSE.txt
Disallow	/MAINTAINERS.txt
Disallow	/update.php
Disallow	/UPGRADE.txt
Disallow	/xmlrpc.php
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips/
Disallow	/logout/
Disallow	/node/add/
Disallow	/search/
Disallow	/user/register/
Disallow	/user/password/
Disallow	/user/login/
Disallow	/?q=admin%2F
Disallow	/?q=comment%2Freply%2F
Disallow	/?q=filter%2Ftips%2F
Disallow	/?q=logout%2F
Disallow	/?q=node%2Fadd%2F
Disallow	/?q=search%2F
Disallow	/?q=user%2Fpassword%2F
Disallow	/?q=user%2Fregister%2F
Disallow	/?q=user%2Flogin%2F

Rule

Path

Disallow

/includes/

Disallow

/misc/

Disallow

/modules/

Disallow

/profiles/

Disallow

/scripts/

Disallow

/themes/

Disallow

/CHANGELOG.txt

Disallow

/cron.php

Disallow

/INSTALL.mysql.txt

Disallow

/INSTALL.pgsql.txt

Disallow

/install.php

Disallow

/INSTALL.txt

Disallow

/LICENSE.txt

Disallow

/MAINTAINERS.txt

Disallow

/update.php

Disallow

/UPGRADE.txt

Disallow

/xmlrpc.php

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips/

Disallow

/logout/

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register/

Disallow

/user/password/

Disallow

/user/login/

Disallow

/?q=admin%2F

Disallow

/?q=comment%2Freply%2F

Disallow

/?q=filter%2Ftips%2F

Disallow

/?q=logout%2F

Disallow

/?q=node%2Fadd%2F

Disallow

/?q=search%2F

Disallow

/?q=user%2Fpassword%2F

Disallow

/?q=user%2Fregister%2F

Disallow

/?q=user%2Flogin%2F

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
For syntax checking, see:
http://www.frobee.com/robots-txt-check
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)

omri.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

gptbot

yandex

ahrefsbot

petalbot

mj12bot

semrushbot

screaming frog seo spider

robots-txt-checker

duckduckbot-https

orbbot

seznambot

coccocbot-web

magpie-crawler

*

Other Records

Comments

omri.org
robots.txt