printnotion.com
robots.txt

Robots Exclusion Standard data for printnotion.com

Resource Scan

Scan Details

Site Domain printnotion.com
Base Domain printnotion.com
Scan Status Ok
Last Scan2025-08-28T23:10:32+00:00
Next Scan 2025-09-27T23:10:32+00:00

Last Scan

Scanned2025-08-28T23:10:32+00:00
URL https://printnotion.com/robots.txt
Domain IPs 104.21.22.235, 172.67.207.150, 2606:4700:3032::ac43:cf96, 2606:4700:3033::6815:16eb
Response IP 104.21.22.235
Found Yes
Hash f674ea7704f4fa8be7d31d68f9007768d9811593834e1be7f73941a0c7039491
SimHash c9228ed4c335

Groups

*

Rule Path
Allow /
Disallow /extension/
Disallow /payment/
Disallow *.js$
Disallow *.css$

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.printnotion.com/sitemap.xml

Comments

  • 禁止爬取敏感目录
  • 爬取延迟(可选)
  • Sitemap 位置