community.simon42.com
robots.txt

Robots Exclusion Standard data for community.simon42.com

Resource Scan

Scan Details

Site Domain community.simon42.com
Base Domain simon42.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2025-12-20T20:16:46+00:00
Next Scan 2026-03-20T20:16:46+00:00

Last Successful Scan

Scanned2024-11-03T07:55:56+00:00
URL https://community.simon42.com/robots.txt
Domain IPs 78.47.205.13
Response IP 78.47.205.13
Found Yes
Hash ecce334404e4efe325becde6895fdbb2f360929f2d160d62d42a80acab2da6b9
SimHash 289d1d8577d0

Groups

mauibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

seo spider

Rule Path
Disallow /

*

Rule Path
Disallow /admin/
Disallow /auth/
Disallow /assets/browser-update*.js
Disallow /email/
Disallow /session
Disallow /user-api-key
Disallow /*?api_key*
Disallow /*?*api_key*
Disallow /badges
Disallow /u/
Disallow /my
Disallow /search
Disallow /tag/*/l
Disallow /g
Disallow /t/*/*.rss
Disallow /c/*.rss
Disallow /docs/

googlebot

Rule Path
Disallow /admin/
Disallow /auth/
Disallow /assets/browser-update*.js
Disallow /email/
Disallow /session
Disallow /user-api-key
Disallow /*?api_key*
Disallow /*?*api_key*

Other Records

Field Value
sitemap https://community.simon42.com/sitemap.xml

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file