concepts.org
robots.txt

Robots Exclusion Standard data for concepts.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	concepts.org
Base Domain	concepts.org
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a server error.
Last Scan	2025-06-19T13:38:39+00:00
Next Scan	2025-09-17T13:38:39+00:00

Last Successful Scan

Scanned	2023-03-09T07:57:38+00:00
URL	https://www.concepts.org/robots.txt
Domain IPs	104.21.34.37, 172.67.167.220, 2606:4700:3031::ac43:a7dc, 2606:4700:3034::6815:2225
Response IP	104.21.34.37
Found	Yes
Hash	14a6661a43c12275654e39eb701760a1cf4a314945a4045193feb26e656e1c9e
SimHash	625e4ec1c981

Groups

*

Rule	Path
Disallow	/index.php/Help
Disallow	/index.php/MediaWiki
Disallow	/index.php/Special%3A
Disallow	/index.php/Template
Disallow	/skins/

Rule

Path

Disallow

/index.php/Help

Disallow

/index.php/MediaWiki

Disallow

/index.php/Special%3A

Disallow

/index.php/Template

Disallow

/skins/

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

30

Back to top

Other Records

Field	Value
sitemap	http://www.concepts.org/sitemap-index-concepts.xml

Field

Value

sitemap

http://www.concepts.org/sitemap-index-concepts.xml

Back to top

Comments

This one turns off legit robots from crawling
Disallow: /index.php?
Florent Created:
Disallow: /index.php/
Created by Mediawiki
Created by Florent from robots.txt web site helps

Back to top

Warnings

`host` is not a known field.

Back to top

concepts.orgrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

Other Records

Comments

Warnings

concepts.org
robots.txt