parmapress24.it
robots.txt

Robots Exclusion Standard data for parmapress24.it

Resource Scan

Scan Details

Site Domain parmapress24.it
Base Domain parmapress24.it
Scan Status Ok
Last Scan2024-09-22T12:43:59+00:00
Next Scan 2024-09-29T12:43:59+00:00

Last Scan

Scanned2024-09-22T12:43:59+00:00
URL https://parmapress24.it/robots.txt
Redirect https://www.parmapress24.it/robots.txt
Redirect Domain www.parmapress24.it
Redirect Base parmapress24.it
Domain IPs 173.212.218.114
Redirect IPs 173.212.218.114
Response IP 173.212.218.114
Found Yes
Hash 974bb6aaf7e8d29c38ca2cd47b3d2abee4a3f073b61c4edffdeba3d6a876e223
SimHash 50107141c7a4

Groups

anthropic-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

youbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

swiftbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

weborama

Rule Path
Disallow /

garlik

Rule Path
Disallow /

hypefactors

Rule Path
Disallow /

seekport

Rule Path
Disallow /

Comments

  • AI-Related Bots
  • General Crawlers and Indexing Bots