nucleo.jor.br
robots.txt

Robots Exclusion Standard data for nucleo.jor.br

Resource Scan

Scan Details

Site Domain nucleo.jor.br
Base Domain nucleo.jor.br
Scan Status Ok
Last Scan2025-11-15T18:27:23+00:00
Next Scan 2025-11-16T18:27:23+00:00

Last Scan

Scanned2025-11-15T18:27:23+00:00
URL https://nucleo.jor.br/robots.txt
Domain IPs 104.21.96.40, 172.67.172.70, 2606:4700:3032::6815:6028, 2606:4700:3033::ac43:ac46
Response IP 172.67.172.70
Found Yes
Hash ebb1e4292e340aac1ee3614c491923aaade0acb486a4003e29aaf617355e902d
SimHash 650c4b01d2c4

Groups

*

Rule Path
Disallow /ghost/
Disallow /email/
Disallow /members/api/comments/counts/
Disallow /r/
Disallow /webmentions/receive/

googlebot

Rule Path
Allow /

ai2bot

Rule Path
Disallow /

ai2bot-dolma

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

andibot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bedrockbot

Rule Path
Disallow /

brightbot 1.0

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-searchbot

Rule Path
Disallow /

claude-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

cotoyogi

Rule Path
Disallow /

crawlspace

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

echoboxbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

factset_spyderbot

Rule Path
Disallow /

firecrawlagent

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-cloudvertexbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

googleother

Rule Path
Disallow /

googleother-image

Rule Path
Disallow /

googleother-video

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

iaskspider/2.0

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

isscyberriskcrawler

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

mistralai-user/1.0

Rule Path
Disallow /

novaact

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

operator

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

panscient

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

perplexity-user

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

phindbot

Rule Path
Disallow /

qualifiedbot

Rule Path
Disallow /

quillbot

Rule Path
Disallow /

quillbot.com

Rule Path
Disallow /

sbintuitionsbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-ocob

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

sidetrade indexer bot

Rule Path
Disallow /

tiktokspider

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

wpbot

Rule Path
Disallow /

yandexadditional

Rule Path
Disallow /

yandexadditionalbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://nucleo.jor.br/sitemap.xml

Comments

  • Allow rules
  • Disallow rules
  • robots.txt generated by Robots.txt Helper