gpu.perigueux.u-bordeaux.fr
robots.txt

Robots Exclusion Standard data for gpu.perigueux.u-bordeaux.fr

Resource Scan

Scan Details

Site Domain gpu.perigueux.u-bordeaux.fr
Base Domain u-bordeaux.fr
Scan Status Ok
Last Scan2025-09-30T02:25:07+00:00
Next Scan 2025-10-30T02:25:07+00:00

Last Scan

Scanned2025-09-30T02:25:07+00:00
URL https://gpu.perigueux.u-bordeaux.fr/robots.txt
Domain IPs 147.210.181.87
Response IP 147.210.181.87
Found Yes
Hash 64d54144524e0c91fd39c2b19332a538463f2d1f4503800933186b5a6602b008
SimHash 749e0151c5f4

Groups

ai2bot

Rule Path
Disallow /

ai2bot-dolma

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

googleother

Rule Path
Disallow /

googleother-image

Rule Path
Disallow /

googleother-video

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

iaskspider/2.0

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

isscyberriskcrawler

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

sidetrade indexer bot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

yandex

Rule Path
Disallow /

youbot

Rule Path
Disallow /
Disallow /widgets/

Other Records

Field Value
sitemap https://gpu.perigueux.u-bordeaux.fr/sitemap.xml

Warnings

  • `utilisateur-agent` is not a known field.