inetspec.net
robots.txt
Robots Exclusion Standard data for inetspec.net
Resource Scan
Scan Details
Site Domain | inetspec.net |
Base Domain | inetspec.net |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-08-01T06:51:52+00:00 |
Next Scan | 2024-10-30T06:51:52+00:00 |
Last Successful Scan
Scanned | 2022-12-16T01:33:16+00:00 |
URL | http://inetspec.net/robots.txt |
Redirect | https://inetspec.com/robots.txt |
Redirect Domain | inetspec.com |
Redirect Base | inetspec.com |
Domain IPs | 66.228.138.150 |
Redirect IPs | 66.228.138.150 |
Response IP | 66.228.138.150 |
Found | Yes |
Hash | 8cfd95f889ccbb6c3a66115c018020b305805f94a7e28cb85e9653dc38be8a2f |
SimHash | 241679c747f4 |
Groups
*
Rule | Path |
---|---|
Disallow | / |
butterfly
charlotte
exabot
envolk
gigabot
scoutjet
speedy
teoma
turnitinbot
twiceler
yowedobot
mj12bot
Rule | Path |
---|---|
Disallow | |
Disallow | /adm |
Disallow | /php |
Disallow | /tmp |
Disallow | /upl |
Disallow | /index.php |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
ia_archiver
Rule | Path |
---|---|
Disallow | |
Disallow | /adm |
Disallow | /php |
Disallow | /tmp |
Disallow | /upl |
Disallow | /index.php |
adsbot-google
googlebot
Rule | Path |
---|---|
Disallow | /files |
Disallow | /adm |
Disallow | /php |
Disallow | /tmp |
Disallow | /uploads |
Disallow | /index.php |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | http://www.inetspec.com/sitemap |
sitemap | http://www.inetspec.com/sitemap.xml |
Comments