pspdfkit.com
robots.txt

Robots Exclusion Standard data for pspdfkit.com

Resource Scan

Scan Details

Site Domain pspdfkit.com
Base Domain pspdfkit.com
Scan Status Ok
Last Scan2025-10-13T15:24:02+00:00
Next Scan 2025-11-12T15:24:02+00:00

Last Scan

Scanned2025-10-13T15:24:02+00:00
URL https://pspdfkit.com/robots.txt
Redirect https://www.nutrient.io/robots.txt
Redirect Domain www.nutrient.io
Redirect Base nutrient.io
Domain IPs 172.66.40.179, 172.66.43.77, 2606:4700:3108::ac42:28b3, 2606:4700:3108::ac42:2b4d
Redirect IPs 172.66.40.122, 172.66.43.134, 2606:4700:3108::ac42:287a, 2606:4700:3108::ac42:2b86
Response IP 172.66.43.134
Found Yes
Hash e2b7189cdb3d6d59735420e5fe9eeb05bde27fb99b3898975797d75efce7adf8
SimHash 39954911e942

Groups

oai-searchbot

Rule Path
Disallow /*.pdf$

chatgpt-user

Rule Path
Disallow /*.pdf$

gptbot

Rule Path
Disallow /*.pdf$

claudebot

Rule Path
Disallow /*.pdf$

claude-searchbot

Rule Path
Disallow /*.pdf$

claude-user

Rule Path
Disallow /*.pdf$

perplexitybot

Rule Path
Disallow /*.pdf$

google-extended

Rule Path
Disallow /*.pdf$

applebot-extended

Rule Path
Disallow /*.pdf$

meta-externalagent

Rule Path
Disallow /*.pdf$

amazonbot

Rule Path
Disallow /*.pdf$

ccbot

Rule Path
Disallow /*.pdf$

kagibot

Rule Path
Disallow /*.pdf$

duckassistbot

Rule Path
Disallow /*.pdf$

youbot

Rule Path
Disallow /*.pdf$

*

Rule Path
Disallow /*.pdf$

Other Records

Field Value
sitemap https://www.nutrient.io/sitemap.xml