forum.linuxconfig.org
robots.txt

Robots Exclusion Standard data for forum.linuxconfig.org

Resource Scan

Scan Details

Site Domain forum.linuxconfig.org
Base Domain linuxconfig.org
Scan Status Ok
Last Scan2025-06-12T04:45:34+00:00
Next Scan 2025-07-12T04:45:34+00:00

Last Scan

Scanned2025-06-12T04:45:34+00:00
URL https://forum.linuxconfig.org/robots.txt
Domain IPs 172.66.40.244, 172.66.43.12, 2606:4700:3108::ac42:28f4, 2606:4700:3108::ac42:2b0c
Response IP 172.66.43.12
Found Yes
Hash bcd5301db5893bb64acf82c062119519fbb8d847a6ddd82bf11acae8c8e3544c
SimHash 089d1dc577f1

Groups

mauibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

seo spider

Rule Path
Disallow /

*

Rule Path
Disallow /admin/
Disallow /auth/
Disallow /assets/browser-update*.js
Disallow /email/
Disallow /session
Disallow /user-api-key
Disallow /*?api_key*
Disallow /*?*api_key*
Disallow /badges
Disallow /my
Disallow /search
Disallow /tag/*/l
Disallow /g
Disallow /t/*/*.rss
Disallow /c/*.rss

googlebot

Rule Path
Disallow /admin/
Disallow /auth/
Disallow /assets/browser-update*.js
Disallow /email/
Disallow /session
Disallow /user-api-key
Disallow /*?api_key*
Disallow /*?*api_key*

Other Records

Field Value
sitemap https://forum.linuxconfig.org/sitemap.xml

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file