webhappy.net
robots.txt

Robots Exclusion Standard data for webhappy.net

Resource Scan

Scan Details

Site Domain webhappy.net
Base Domain webhappy.net
Scan Status Ok
Last Scan2025-10-20T21:48:18+00:00
Next Scan 2025-11-19T21:48:18+00:00

Last Scan

Scanned2025-10-20T21:48:18+00:00
URL https://webhappy.net/robots.txt
Redirect http://www.webhappy.net/robots.txt
Redirect Domain www.webhappy.net
Redirect Base webhappy.net
Domain IPs 104.21.52.101, 172.67.198.17, 2606:4700:3033::6815:3465, 2606:4700:3034::ac43:c611
Redirect IPs 104.21.52.101, 172.67.198.17, 2606:4700:3033::6815:3465, 2606:4700:3034::ac43:c611
Response IP 172.67.198.17
Found Yes
Hash 2bd8d795654b3daf1d149db0a40946605587fedbd8a4901da0107196e8bde7e9
SimHash 75858cd98d13

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /verify110manu.html
Disallow /ce_cust_403.html
Disallow /verify430manu.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /site.txt

Other Records

Field Value
sitemap /sitemap.xml