cetoketo.com
robots.txt

Robots Exclusion Standard data for cetoketo.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cetoketo.com
Base Domain	cetoketo.com
Scan Status	Ok
Last Scan	2024-09-27T08:23:10+00:00
Next Scan	2024-10-04T08:23:10+00:00

Last Scan

Scanned	2024-09-27T08:23:10+00:00
URL	https://cetoketo.com/robots.txt
Domain IPs	170.39.213.85
Response IP	170.39.213.85
Found	Yes
Hash	31527cdb16598ef9d8a9fe8d50cdf00905c494ed1274d9b716b8addd5e463efc
SimHash	b33bb55f0a67

Groups

*

Rule	Path
Disallow	/administrator/
Disallow	/bin/
Disallow	/cache/
Disallow	/cli/
Disallow	/components/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/layouts/
Disallow	/libraries/
Disallow	/logs/
Disallow	/modules/
Disallow	/plugins/
Disallow	/tmp/
Disallow	/vota/
Disallow	/votacion/
Disallow	/votacionz/
Disallow	/pdfa/
Disallow	/code/
Disallow	/wp-admin/

Rule

Path

Disallow

/administrator/

Disallow

/bin/

Disallow

/cache/

Disallow

/cli/

Disallow

/components/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/layouts/

Disallow

/libraries/

Disallow

/logs/

Disallow

/modules/

Disallow

/plugins/

Disallow

/tmp/

Disallow

/vota/

Disallow

/votacion/

Disallow

/votacionz/

Disallow

/pdfa/

Disallow

/code/

Disallow

/wp-admin/

rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
chinaclaw
custo
disco
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
semrushbot
semrushbot-bm
semrushbot-sa
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-seoab
ahrefsbot
mj12bot
yandexbot
baiduspider
spbot
petalbot
dotbot
mauibot
pinterest
yandeximageresizer
coccocbot
coccocbot-web
coccocbot-image
yeti
baiduspider
sogou web spider
sogou
seekbot
seekport
seekport crawler
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Comments

If the Joomla site is installed within a folder such as at
e.g. www.example.com/joomla/ the robots.txt file MUST be
moved to the site root at e.g. www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to the disallowed
path, e.g. the Disallow rule for the /administrator/ folder
MUST be changed to read Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html

Back to top

Warnings

1 invalid line.

Back to top

cetoketo.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Comments

Warnings

cetoketo.com
robots.txt