guazi.com
robots.txt

Robots Exclusion Standard data for guazi.com

Resource Scan

Scan Details

Site Domain guazi.com
Base Domain guazi.com
Scan Status Ok
Last Scan2024-06-01T19:16:08+00:00
Next Scan 2024-07-01T19:16:08+00:00

Last Scan

Scanned2024-06-01T19:16:08+00:00
URL https://guazi.com/robots.txt
Redirect https://www.guazi.com/robots.txt
Redirect Domain www.guazi.com
Redirect Base guazi.com
Domain IPs 124.251.6.133
Redirect IPs 124.251.6.133
Response IP 124.251.6.133
Found Yes
Hash 14ac07fff5b9bb889bf885d42e10a0ae4e2e8b5851116769676b8f53a0c03a56
SimHash 200d515141b1

Groups

baiduspider

Rule Path
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

yahoo! slurp

Rule Path
Allow /

yahoo! slurp china

Rule Path
Allow /

yahoo!-adcrawler

Rule Path
Allow /

youdaobot

Rule Path
Allow /

sosospider

Rule Path
Allow /

sogou spider

Rule Path
Allow /

msnbot

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.guazi.com/sitemap.xml

Warnings

  • 2 invalid lines.