01com.com
robots.txt

Robots Exclusion Standard data for 01com.com

Resource Scan

Scan Details

Site Domain 01com.com
Base Domain 01com.com
Scan Status Ok
Last Scan2024-05-21T22:16:10+00:00
Next Scan 2024-06-20T22:16:10+00:00

Last Scan

Scanned2024-05-21T22:16:10+00:00
URL https://01com.com/robots.txt
Redirect https://www.01com.com/robots.txt
Redirect Domain www.01com.com
Redirect Base 01com.com
Domain IPs 18.211.129.93, 34.195.39.85
Redirect IPs 18.211.129.93, 34.195.39.85
Response IP 34.195.39.85
Found Yes
Hash c63932ec5e6732f7fd7f01e35a013448a4d1d3d6d727e3d253b7fd06c268623c
SimHash ab75d4f04319

Groups

*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /components/
Disallow /images/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /tmp/
Disallow /xmlrpc/
Disallow /blog/
Disallow /01com/
Disallow /meeting/webhelp/
Disallow /*.swf$

browsershots

Rule Path
Disallow /

titan

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

Other Records

Field Value
sitemap http://www.01com.com/sitemap.xml