linuxathome.net
robots.txt

Robots Exclusion Standard data for linuxathome.net

Resource Scan

Scan Details

Site Domain linuxathome.net
Base Domain linuxathome.net
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-09-09T21:42:49+00:00
Next Scan 2025-10-09T21:42:49+00:00

Last Successful Scan

Scanned2025-07-19T06:47:30+00:00
URL https://linuxathome.net/robots.txt
Domain IPs 110.232.143.61, 2400:b800:3:1::6d
Response IP 110.232.143.61
Found Yes
Hash b8f4ba8224ba163670fc13539392f732cb9a051fc7184e91f6eeb65eea088e73
SimHash 609e184124e2

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /files
Disallow /gallery2
Disallow /openads
Disallow /poll
Disallow /rdf
Disallow /scripts
Disallow /shoutbox
Disallow /sysinfo
Disallow /wap

Other Records

Field Value
sitemap http://cdn.attracta.com/sitemap/2733180.xml.gz

Comments

  • /robots.txt file for http://linuxathome.net/
  • mail webmaster@linuxathome.net for constructive criticism
  • Begin Attracta SEO Tools Sitemap. Do not remove
  • End Attracta SEO Tools Sitemap. Do not remove