suivi-des-incidents.orange.fr
robots.txt

Robots Exclusion Standard data for suivi-des-incidents.orange.fr

Resource Scan

Scan Details

Site Domain suivi-des-incidents.orange.fr
Base Domain orange.fr
Scan Status Ok
Last Scan2024-11-03T07:32:05+00:00
Next Scan 2024-12-03T07:32:05+00:00

Last Scan

Scanned2024-11-03T07:32:05+00:00
URL https://suivi-des-incidents.orange.fr/robots.txt
Domain IPs 193.252.148.217
Response IP 193.252.122.104
Found Yes
Hash f87b0f5e94fac7aadfc503908dc6f2e5b6891f4345c525ff1e913ca473bc9a4e
SimHash b8d973c8cf33

Groups

*

Rule Path
Disallow /indisponible
Disallow /service-indisponible
Disallow /*?*

*

Rule Path
Disallow /*utm_

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

fetch

Rule Path
Disallow /

httrack

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

doc

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

fast

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linko

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

npbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

xenu

Rule Path
Disallow /

zao

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

Other Records

Field Value
sitemap https://suivi-des-incidents.orange.fr/sitemap.xml

Comments

  • Notice:
  • Crawling suivi-des-incidents.orange.fr is prohibited unless you have express permission or are a
  • legitimate public search engine crawler under normal conditions of use
  • ---
  • Block indexing Url for UTM parameters
  • ---
  • ---
  • Unwanted crawlers
  • ---
  • ---
  • Unwanted scrappers
  • ---
  • ---
  • Unwanted other bots
  • ---