mycleanunion.com
robots.txt

Robots Exclusion Standard data for mycleanunion.com

Resource Scan

Scan Details

Site Domain mycleanunion.com
Base Domain mycleanunion.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-02T05:10:15+00:00
Next Scan 2025-12-31T05:10:15+00:00

Last Successful Scan

Scanned2024-08-16T00:54:02+00:00
URL https://mycleanunion.com/robots.txt
Domain IPs 104.21.62.254, 172.67.168.241, 2606:4700:3033::ac43:a8f1, 2606:4700:3034::6815:3efe
Response IP 172.67.168.241
Found Yes
Hash 44688b483a0388ef71e8edc06635304f5bf08b5b8fe61c59f6a1ee3ee7590497
SimHash 081edd60ea9b

Groups

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://mycleanunion.com/sitemap.xml