telephoneannuaire.fr
robots.txt

Robots Exclusion Standard data for telephoneannuaire.fr

Resource Scan

Scan Details

Site Domain telephoneannuaire.fr
Base Domain telephoneannuaire.fr
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-04T02:46:41+00:00
Next Scan 2024-11-18T02:46:41+00:00

Last Successful Scan

Scanned2024-10-19T20:02:33+00:00
URL https://telephoneannuaire.fr/robots.txt
Redirect https://d3mydwg980rlym.cloudfront.net/0e0d5d18-cd42-4267-b1fc-652570c535f9/robots.txt
Redirect Domain d3mydwg980rlym.cloudfront.net
Redirect Base d3mydwg980rlym.cloudfront.net
Domain IPs 13.35.238.121, 13.35.238.44, 13.35.238.54, 13.35.238.67
Redirect IPs 108.156.139.202, 108.156.139.218, 108.156.139.37, 108.156.139.70, 2600:9000:2755:1600:11:8615:fb80:21, 2600:9000:2755:2800:11:8615:fb80:21, 2600:9000:2755:4600:11:8615:fb80:21, 2600:9000:2755:6800:11:8615:fb80:21, 2600:9000:2755:6a00:11:8615:fb80:21, 2600:9000:2755:8e00:11:8615:fb80:21, 2600:9000:2755:a600:11:8615:fb80:21, 2600:9000:2755:b000:11:8615:fb80:21
Response IP 108.156.139.37
Found Yes
Hash c035390546a462b22427d7a7894541d6699a601c7caff646ab3d4152b090f8c8
SimHash db6ec740f432

Groups

*

Rule Path
Disallow /gdpr

mediapartners-google

Rule Path
Allow /

bl.uk_lddc_bot

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

addthis

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

affectv robot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

b2w/0.1

Rule Path
Disallow /

bitlybot

Rule Path
Disallow /

bizzinformation

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

butterfly

Rule Path
Disallow /

careerbot

Rule Path
Disallow /

changedetection

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

compspybot

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

crescent

Rule Path
Disallow /

discoverybot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotnetdotcom

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

exabot

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

fast enterprise crawler

Rule Path
Disallow /

genieo

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

grub

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

integromedb

Rule Path
Disallow /

jetbot/1.0

Rule Path
Disallow /

larbin

Rule Path
Disallow /

linguatools

Rule Path
Disallow /

lipperhey

Rule Path
Disallow /

lnspiderguy

Rule Path
Disallow /

looksmart

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

muscat ferret

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

phantom.js bot

Rule Path
Disallow /

pixray-seeker

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

proximic

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

purebot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

seomoz

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

snoopy

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

solomonobot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spbot

Rule Path
Disallow /

spiderjack

Rule Path
Disallow /

speedy

Rule Path
Disallow /

stalker

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

teleport

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

thesubot

Rule Path
Disallow /

thumbshots-de-bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

url_spider_pro

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

websauger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webzip/4.0

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yamanalab-robot

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

zmeu

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

spiderling

Rule Path
Disallow /

webcrawler

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

idg/it

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

admantx

Rule Path
Disallow /

mixrankbot

Rule Path
Disallow /

findxbot

Rule Path
Disallow /

yetibot

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

metajobbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

lcc

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

domaincrawler/3.0

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

domainappender

Rule Path
Disallow /

wiederfreibot

Rule Path
Disallow /

wiederfreibot/1.0

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

llc

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

riddler

Rule Path
Disallow /

deusu

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

ips-agent

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

hybridbot

Rule Path
Disallow /

getintent crawler

Rule Path
Disallow /

ias_crawler

Rule Path
Disallow /

nekstbot

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

obot

Rule Path
Disallow /

netseer

Rule Path
Disallow /

adbeat_bot

Rule Path
Disallow /

lightspeedsystemscrawler

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

i4ds-bot

Rule Path
Disallow /

eyeotabot/1.0

Rule Path
Disallow /

lumtelbot/1.0

Rule Path
Disallow /

ttd-content

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.telephoneannuaire.fr/sitemaps/2024-03-26/sitemap-index.xml

Comments

  • cinsky bot

Warnings

  • 4 invalid lines.