programme-television.org
robots.txt

Robots Exclusion Standard data for programme-television.org

Resource Scan

Scan Details

Site Domain programme-television.org
Base Domain programme-television.org
Scan Status Ok
Last Scan2024-06-21T22:40:27+00:00
Next Scan 2024-06-28T22:40:27+00:00

Last Scan

Scanned2024-06-21T22:40:27+00:00
URL https://programme-television.org/robots.txt
Domain IPs 3.163.158.122, 3.163.158.18, 3.163.158.23, 3.163.158.28
Response IP 18.165.171.117
Found Yes
Hash 2ea6272669858e14485cf4e171ab169cfec3d7b1104608e0a21d02491385a2e9
SimHash 2846d0400c17

Groups

*

Rule Path
Disallow /*escaped_fragment
Disallow /api
Disallow /recherche
Disallow /replay/moteur
Disallow /replay/redirection
Disallow /replay/replaylist
Disallow /Teasers/*
Disallow /newsletter/

meltawer
digimind
knowings
sindup
cision
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
omgilibot
omgili
xenu link sleuth/1.3.8
chatgpt-user
ccbot
gptbot
google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.programme-television.org/sitemap/news.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • Sitemaps
  • Robots exclus

Warnings

  • 2 invalid lines.