willethfarm.com
robots.txt

Robots Exclusion Standard data for willethfarm.com

Resource Scan

Scan Details

Site Domain willethfarm.com
Base Domain willethfarm.com
Scan Status Ok
Last Scan2024-05-31T01:32:25+00:00
Next Scan 2024-06-07T01:32:25+00:00

Last Scan

Scanned2024-05-31T01:32:25+00:00
URL https://willethfarm.com/robots.txt
Redirect http://www.willethfarm.com/robots.txt
Redirect Domain www.willethfarm.com
Redirect Base willethfarm.com
Domain IPs 104.21.56.120, 172.67.150.213, 2606:4700:3032::ac43:96d5, 2606:4700:3034::6815:3878
Redirect IPs 104.21.56.120, 172.67.150.213, 2606:4700:3032::ac43:96d5, 2606:4700:3034::6815:3878
Response IP 172.67.150.213
Found Yes
Hash f91706d2237aa9d4a7558865c5bc32fac5ee7708b488294977875f07848635fb
SimHash 75850ddd8d11

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /thirdcode/
Disallow /site.txt

Other Records

Field Value
sitemap /sitemap.xml