senecalakeguardian.org
robots.txt

Robots Exclusion Standard data for senecalakeguardian.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	senecalakeguardian.org
Base Domain	senecalakeguardian.org
Scan Status	Ok
Last Scan	2024-10-19T09:50:40+00:00
Next Scan	2024-11-18T09:50:40+00:00

Last Scan

Scanned	2024-10-19T09:50:40+00:00
URL	https://senecalakeguardian.org/robots.txt
Domain IPs	35.91.48.185
Response IP	35.91.48.185
Found	Yes
Hash	43259678e73d40cd1b52ec926e357a8580c916add72e6dd14b00589083a80238
SimHash	710ca002d6b0

Groups

googlebot
googlebot-imageuser-agent: mediapartners-google
bingbot
msnbot
msnbot-media
yahoo-blogs
yahoo-mmcrawler

Rule	Path
Allow	/

Rule

Path

Allow

/

ahrefsbot
amazonbot
architextspider
baiduspider
blexbot
bytespider
claudebot
discoveryengine.com
domainreanimator.com
dtsagent
ezooms
fbot
findestars
gsa-crawler
megaindex.ru
mj12bot
myonid
openlinkprofiler.org
opensiteexplorer.org
peekyou
pipl
rapleaf
riddler.io
snitch
spock
terrykyleseoagency.com
tweepz
wink
scooter
slurp
sosospider
teoma
wbsearchbot
wpspider
xovibot.net
yandex
yasni
yeti
yoname
yourtraces
zoominfo
ia_archiver
semrushbot
semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/catalog/
Disallow	/catalogFiles/
Disallow	/eblasts/
Disallow	/flash/
Disallow	/images/
Disallow	/requires/
Disallow	/rte/
Disallow	/survey/
Disallow	/templates/

Rule

Path

Disallow

/catalog/

Disallow

/catalogFiles/

Disallow

/eblasts/

Disallow

/flash/

Disallow

/images/

Disallow

/requires/

Disallow

/rte/

Disallow

/survey/

Disallow

/templates/

Back to top

Other Records

Field	Value
sitemap	https://www.senecalakeguardian.org/sitemap.xml

Field

Value

sitemap

https://www.senecalakeguardian.org/sitemap.xml

Back to top

Comments

allow only important bots
blocking people search engines and bad spiders

Back to top

Warnings

2 invalid lines.

Back to top

senecalakeguardian.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebotgooglebot-imageuser-agent: mediapartners-googlebingbotmsnbotmsnbot-mediayahoo-blogsyahoo-mmcrawler

*

Other Records

Comments

Warnings

senecalakeguardian.org
robots.txt

googlebot
googlebot-imageuser-agent: mediapartners-google
bingbot
msnbot
msnbot-media
yahoo-blogs
yahoo-mmcrawler