espace-client.orange.fr
robots.txt

Robots Exclusion Standard data for espace-client.orange.fr

Resource Scan

Scan Details

Site Domain espace-client.orange.fr
Base Domain orange.fr
Scan Status Ok
Last Scan2024-11-14T07:16:34+00:00
Next Scan 2024-11-28T07:16:34+00:00

Last Scan

Scanned2024-11-14T07:16:34+00:00
URL https://espace-client.orange.fr/robots.txt
Domain IPs 193.252.149.163
Response IP 193.252.117.230
Found Yes
Hash bd31decb224081f3468945cb743401959139314ea3b38c6a6004767d76622a2e
SimHash f9d973c1cd33

Groups

*

Rule Path
Disallow /indisponible
Disallow /service-indisponible
Disallow /rechargement-indisponible
Disallow /*?*

*

Rule Path
Disallow /*utm_

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

fetch

Rule Path
Disallow /

httrack

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

doc

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

fast

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linko

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

npbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

xenu

Rule Path
Disallow /

zao

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

Other Records

Field Value
sitemap https://espace-client.orange.fr/sitemap.xml

Comments

  • Notice:
  • Crawling espace-client.orange.fr is prohibited unless you have express permission or are a
  • legitimate public search engine crawler under normal conditions of use
  • ---
  • Block indexing Url for UTM parameters
  • ---
  • ---
  • Unwanted crawlers
  • ---
  • ---
  • Unwanted scrappers
  • ---
  • ---
  • Unwanted other bots
  • ---