son-heung-min-cz.com
robots.txt

Robots Exclusion Standard data for son-heung-min-cz.com

Resource Scan

Scan Details

Site Domain son-heung-min-cz.com
Base Domain son-heung-min-cz.com
Scan Status Ok
Last Scan2024-10-18T01:57:49+00:00
Next Scan 2024-11-17T01:57:49+00:00

Last Scan

Scanned2024-10-18T01:57:49+00:00
URL https://son-heung-min-cz.com/robots.txt
Domain IPs 104.21.49.132, 172.67.163.124, 2606:4700:3034::6815:3184, 2606:4700:3037::ac43:a37c
Response IP 104.21.49.132
Found Yes
Hash 8d6706d6c486a0775864bdc90169b5d1d972de1e433adb5473e7e6fb7bb3c3a5
SimHash 6f30b8604fb2

Groups

*

Rule Path
Disallow /cgi-bin
Allow /wp-admin/admin-ajax.php
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow */embed$
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D

Other Records

Field Value
sitemap https://son-heung-min-cz.com/sitemap.xml