photobookchina.com
robots.txt

Robots Exclusion Standard data for photobookchina.com

Resource Scan

Scan Details

Site Domain photobookchina.com
Base Domain photobookchina.com
Scan Status Ok
Last Scan2024-09-17T08:39:22+00:00
Next Scan 2024-10-17T08:39:22+00:00

Last Scan

Scanned2024-09-17T08:39:22+00:00
URL https://photobookchina.com/robots.txt
Redirect https://www.photobookchina.com/robots.txt
Redirect Domain www.photobookchina.com
Redirect Base photobookchina.com
Domain IPs 13.227.254.112, 13.227.254.116, 13.227.254.84, 13.227.254.90, 2600:9000:200a:2800:4:ca7c:d200:93a1, 2600:9000:200a:6c00:4:ca7c:d200:93a1, 2600:9000:200a:8600:4:ca7c:d200:93a1, 2600:9000:200a:9600:4:ca7c:d200:93a1, 2600:9000:200a:aa00:4:ca7c:d200:93a1, 2600:9000:200a:c400:4:ca7c:d200:93a1, 2600:9000:200a:e400:4:ca7c:d200:93a1, 2600:9000:200a:e600:4:ca7c:d200:93a1
Redirect IPs 13.227.254.112, 13.227.254.116, 13.227.254.84, 13.227.254.90, 2600:9000:200a:5400:4:ca7c:d200:93a1, 2600:9000:200a:6800:4:ca7c:d200:93a1, 2600:9000:200a:7400:4:ca7c:d200:93a1, 2600:9000:200a:8000:4:ca7c:d200:93a1, 2600:9000:200a:8800:4:ca7c:d200:93a1, 2600:9000:200a:9600:4:ca7c:d200:93a1, 2600:9000:200a:c200:4:ca7c:d200:93a1, 2600:9000:200a:dc00:4:ca7c:d200:93a1
Response IP 13.227.254.112
Found Yes
Hash b6d4968aabaebf393dab4236866219bf40c272c87b8715dcecae86c710718c92
SimHash 180c91a02e90

Groups

semrushbot

Rule Path
Disallow /

*

Rule Path
Disallow /checkout/*
Disallow /account/*
Disallow /search/
Disallow */editor
Disallow *?sort=
Disallow *sentry.getprintbox.com/*

Other Records

Field Value
sitemap https://pbww-ap-prod.s3.amazonaws.com/sitemap/sitemap_cn.xml

Comments

  • Crawlers Setup
  • Directories
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Ajax