cuocazza.com
robots.txt

Robots Exclusion Standard data for cuocazza.com

Resource Scan

Scan Details

Site Domain cuocazza.com
Base Domain cuocazza.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-05-11T22:52:58+00:00
Next Scan 2025-08-09T22:52:58+00:00

Last Successful Scan

Scanned2023-10-20T19:37:12+00:00
URL https://cuocazza.com/robots.txt
Domain IPs 104.21.33.43, 172.67.141.41, 2606:4700:3030::ac43:8d29, 2606:4700:3037::6815:212b
Response IP 172.67.141.41
Found Yes
Hash bbb39d52e2de9ee754a39d695000f5ea84cee27b2e69a55d7ccf945081dc6292
SimHash 65105050c391

Groups

*

Rule Path
Disallow /404
Disallow /data-deletion
Disallow /logout
Disallow /goto
Disallow /goto/
Disallow /search-article
Disallow /search
Disallow /tim-kiem/
Disallow /tim-kiem-truyen
Disallow /w/
Disallow /wp-content/
Disallow /sw.js

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

yandex

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /