programme-television.org
robots.txt
Robots Exclusion Standard data for programme-television.org
Resource Scan
Scan Details
Site Domain | programme-television.org |
Base Domain | programme-television.org |
Scan Status | Ok |
Last Scan | 2024-06-21T22:40:27+00:00 |
Next Scan | 2024-06-28T22:40:27+00:00 |
Last Scan
Scanned | 2024-06-21T22:40:27+00:00 |
URL | https://programme-television.org/robots.txt |
Domain IPs | 3.163.158.122, 3.163.158.18, 3.163.158.23, 3.163.158.28 |
Response IP | 18.165.171.117 |
Found | Yes |
Hash | 2ea6272669858e14485cf4e171ab169cfec3d7b1104608e0a21d02491385a2e9 |
SimHash | 2846d0400c17 |
Groups
*
Rule | Path |
---|---|
Disallow | /*escaped_fragment |
Disallow | /api |
Disallow | /recherche |
Disallow | /replay/moteur |
Disallow | /replay/redirection |
Disallow | /replay/replaylist |
Disallow | /Teasers/* |
Disallow | /newsletter/ |
meltawer
digimind
knowings
sindup
cision
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
omgilibot
omgili
xenu link sleuth/1.3.8
chatgpt-user
ccbot
gptbot
google-extended
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.programme-television.org/sitemap/news.xml |
Warnings
- 2 invalid lines.
Comments