concepts.org
robots.txt

Robots Exclusion Standard data for concepts.org

Resource Scan

Scan Details

Site Domain concepts.org
Base Domain concepts.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2025-06-19T13:38:39+00:00
Next Scan 2025-09-17T13:38:39+00:00

Last Successful Scan

Scanned2023-03-09T07:57:38+00:00
URL https://www.concepts.org/robots.txt
Domain IPs 104.21.34.37, 172.67.167.220, 2606:4700:3031::ac43:a7dc, 2606:4700:3034::6815:2225
Response IP 104.21.34.37
Found Yes
Hash 14a6661a43c12275654e39eb701760a1cf4a314945a4045193feb26e656e1c9e
SimHash 625e4ec1c981

Groups

*

Rule Path
Disallow /index.php/Help
Disallow /index.php/MediaWiki
Disallow /index.php/Special%3A
Disallow /index.php/Template
Disallow /skins/

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap http://www.concepts.org/sitemap-index-concepts.xml

Comments

  • This one turns off legit robots from crawling
  • Disallow: /index.php?
  • Florent Created:
  • Disallow: /index.php/
  • Created by Mediawiki
  • Created by Florent from robots.txt web site helps

Warnings

  • `host` is not a known field.