gladiacteur.com
robots.txt

Robots Exclusion Standard data for gladiacteur.com

Resource Scan

Scan Details

Site Domain gladiacteur.com
Base Domain gladiacteur.com
Scan Status Ok
Last Scan2024-09-28T16:01:04+00:00
Next Scan 2024-10-05T16:01:04+00:00

Last Scan

Scanned2024-09-28T16:01:04+00:00
URL https://gladiacteur.com/robots.txt
Domain IPs 173.212.208.33
Response IP 173.212.208.33
Found Yes
Hash 479d223f3b9430aeb680265f3bb197cda272bdaa2b1a322a4215dd8d5588f6b6
SimHash e04a7c904f65

Groups

*

Rule Path
Allow /*/autoptimize/autoptimize_*.php*
Disallow /*blackhole
Disallow /?blackhole
Disallow /wp-login.php
Disallow */trackback
Disallow /*/comments
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi
Allow /*css?*
Allow /*js?*
Allow /*?utm*
Allow /css/?

googlebot-image

Rule Path
Allow /*

mediapartners-google*

Rule Path
Allow /*

meltawer

Rule Path
Disallow /

digimind

Rule Path
Disallow /

knowings

Rule Path
Disallow /

sindup

Rule Path
Disallow /

cision

Rule Path
Disallow /

talkwater

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

zite

Rule Path
Disallow /

flipboard

Rule Path
Disallow /

youmag

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

trendybuzz

Rule Path
Disallow /

spotter

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

augure

Rule Path
Disallow /

corporama

Rule Path
Disallow /

readability.com

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

libwww

Rule Path
Disallow /

wget

Rule Path
Disallow /

adequat

Rule Path
Disallow /

adequat-systems

Rule Path
Disallow /

auramundi

Rule Path
Disallow /

coexel

Rule Path
Disallow /

ellisphere

Rule Path
Disallow /

leadbox

Rule Path
Disallow /

mention

Rule Path
Disallow /

moreover

Rule Path
Disallow /

mytwip

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

opinion-tracker

Rule Path
Disallow /

proxem

Rule Path
Disallow /

score3

Rule Path
Disallow /

trendeo

Rule Path
Disallow /

vecteurplus

Rule Path
Disallow /

verticalsearch

Rule Path
Disallow /

vsw

Rule Path
Disallow /

winello

Rule Path
Disallow /

fetch

Rule Path
Disallow /

infoseek

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

asknread.com

Rule Path
Disallow /

ellisphere

Rule Path
Disallow /

spotter

Rule Path
Disallow /

*

Rule Path
Disallow /wp-content/uploads/wp-import-export-lite/

*

Rule Path
Disallow /

bingbot
googlebot
slurp

Rule Path
Allow /

*

Rule Path
Disallow /

bingbot
googlebot
slurp

Rule Path
Allow /

*

Rule Path
Disallow /

bingbot
googlebot
slurp

Rule Path
Allow /

*

Rule Path
Disallow /

bingbot
googlebot
slurp

Rule Path
Allow /

*

Rule Path
Disallow /

bingbot
googlebot
slurp

Rule Path
Allow /

Other Records

Field Value
sitemap https://gladiacteur.com/sitemap_index.xml
sitemap http://gladiacteur.com/sitemap.xml
sitemap http://www.gladiacteur.com/sitemap.xml
sitemap http://173.212.208.33:8090/sitemap.xml
sitemap http://vmi156097.contaboserver.net:8090/sitemap.xml
sitemap http://www.gladiacteur.com:8090/sitemap.xml

Comments

  • URLs que je ne veux pas indexer : Login Trackbacks Commentaires
  • URLs autorisées CSS JS Analytics pour les Bots
  • Autoriser Google Image
  • Autoriser Google AdSense
  • WP Import Export Rule

Warnings

  • 4 invalid lines.