dailyworkhorse.com
robots.txt
Robots Exclusion Standard data for dailyworkhorse.com
Resource Scan
Scan Details
Site Domain | dailyworkhorse.com |
Base Domain | dailyworkhorse.com |
Scan Status | Ok |
Last Scan | 2024-09-25T15:40:23+00:00 |
Next Scan | 2024-10-02T15:40:23+00:00 |
Last Scan
Scanned | 2024-09-25T15:40:23+00:00 |
URL | https://dailyworkhorse.com/robots.txt |
Domain IPs | 104.21.95.103, 172.67.144.73, 2606:4700:3031::ac43:9049, 2606:4700:3036::6815:5f67 |
Response IP | 172.67.144.73 |
Found | Yes |
Hash | e359638a27615cb8ef73739a1f046324e0e02379b54b12920971b6ef8f91c041 |
SimHash | 49cf4c40ec83 |
Groups
googlebot
Rule | Path |
---|---|
Disallow | /*/trackback |
Disallow | /*/feed |
Disallow | /*/comments |
Disallow | /*?* |
Disallow | /*? |
Disallow | /*page/* |
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /wp-content/plugins/ |
Disallow | /wp-content/themes/ |
Disallow | /trackback |
Disallow | /comments |
Disallow | /feed |
Other Records
Field | Value |
---|---|
sitemap | http://www.dailyworkhorse.com/sitemap.xml |