puremedias.com
robots.txt

Robots Exclusion Standard data for puremedias.com

Resource Scan

Scan Details

Site Domain puremedias.com
Base Domain puremedias.com
Scan Status Ok
Last Scan2026-01-17T23:31:34+00:00
Next Scan 2026-01-24T23:31:34+00:00

Last Scan

Scanned2026-01-17T23:31:34+00:00
URL https://puremedias.com/robots.txt
Redirect https://www.ozap.com/robots.txt
Redirect Domain www.ozap.com
Redirect Base ozap.com
Domain IPs 217.70.184.55
Redirect IPs 104.18.36.160, 172.64.151.96
Response IP 104.18.36.160
Found Yes
Hash 39b55aae17a55906a0936f5c0cd6ee15d703b3870ca916a4a7dff1ac943b770d
SimHash 289422c2c052

Groups

*

Rule Path
Disallow /recherche
Disallow /recherche/
Disallow /cdn-cgi/
Disallow /_/

arquivo-web-crawler

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatglm

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

omigilibot

Rule Path
Disallow /

youbot

Rule Path
Disallow /