integrify.com
robots.txt

Robots Exclusion Standard data for integrify.com

Resource Scan

Scan Details

Site Domain integrify.com
Base Domain integrify.com
Scan Status Ok
Last Scan2025-10-14T06:53:44+00:00
Next Scan 2025-11-13T06:53:44+00:00

Last Scan

Scanned2025-10-14T06:53:44+00:00
URL https://integrify.com/robots.txt
Redirect https://www.nutrient.io/robots.txt
Redirect Domain www.nutrient.io
Redirect Base nutrient.io
Domain IPs 34.248.245.69, 52.211.88.244, 54.217.251.99
Redirect IPs 172.66.40.122, 172.66.43.134, 2606:4700:3108::ac42:287a, 2606:4700:3108::ac42:2b86
Response IP 172.66.40.122
Found Yes
Hash e2b7189cdb3d6d59735420e5fe9eeb05bde27fb99b3898975797d75efce7adf8
SimHash 39954911e942

Groups

oai-searchbot

Rule Path
Disallow /*.pdf$

chatgpt-user

Rule Path
Disallow /*.pdf$

gptbot

Rule Path
Disallow /*.pdf$

claudebot

Rule Path
Disallow /*.pdf$

claude-searchbot

Rule Path
Disallow /*.pdf$

claude-user

Rule Path
Disallow /*.pdf$

perplexitybot

Rule Path
Disallow /*.pdf$

google-extended

Rule Path
Disallow /*.pdf$

applebot-extended

Rule Path
Disallow /*.pdf$

meta-externalagent

Rule Path
Disallow /*.pdf$

amazonbot

Rule Path
Disallow /*.pdf$

ccbot

Rule Path
Disallow /*.pdf$

kagibot

Rule Path
Disallow /*.pdf$

duckassistbot

Rule Path
Disallow /*.pdf$

youbot

Rule Path
Disallow /*.pdf$

*

Rule Path
Disallow /*.pdf$

Other Records

Field Value
sitemap https://www.nutrient.io/sitemap.xml