revistas.marilia.unesp.br
robots.txt

Robots Exclusion Standard data for revistas.marilia.unesp.br

Resource Scan

Scan Details

Site Domain revistas.marilia.unesp.br
Base Domain unesp.br
Scan Status Ok
Last Scan4/2/2025, 3:52:33 PM
Next Scan 5/2/2025, 3:52:33 PM

Last Scan

Scanned4/2/2025, 3:52:33 PM
URL https://revistas.marilia.unesp.br/robots.txt
Domain IPs 200.145.171.113
Response IP 200.145.171.113
Found Yes
Hash 8bd94407ed08d18d3ab7eb619a22facb749a085aa969237e15898938f72302ac
SimHash 703e0901c1a6

Groups

ahrefsbot
ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
applebot-extended
bytespider
ccbot
chatgpt
chatgpt-user
claude-web
claudebot
cohere-ai
diffbot
dorkbot
duckassistbot
facebookbot
facebookexternalhit
facebookcatalog
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
mj12bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
pangubot
perplexitybot
petalbot
scrapy
sidetrade indexer bot
semrushbot
timpibot
twitterbot
velenpublicwebcrawler
webzio-extended
youbot

Rule Path
Disallow /

python-requests
python-urllib

Rule Path
Allow /index.php/index/oai
Disallow /

*

Rule Path
Disallow /cache/