mattboldt.com
robots.txt

Robots Exclusion Standard data for mattboldt.com

Resource Scan

Scan Details

Site Domain mattboldt.com
Base Domain mattboldt.com
Scan Status Ok
Last Scan2024-09-20T17:18:52+00:00
Next Scan 2024-10-20T17:18:52+00:00

Last Scan

Scanned2024-09-20T17:18:52+00:00
URL https://mattboldt.com/robots.txt
Domain IPs 13.215.144.61, 2406:da18:880:3801::c8, 2406:da18:b3d:e200::64, 52.74.166.77
Response IP 13.228.199.255
Found Yes
Hash aa1423ed3f28bc30dea4d2186bf2f624c1e7050d5eb89aa641d6407c335c777f
SimHash 3c551bd2e2db

Groups

adsbot-google
adsbot-google-mobile-apps
adidxbot
applebot
applenewsbot
baiduspider
baiduspider-image
bingbot
bingpreview
ccbot
cliqzbot
coccoc
coccocbot-image
coccocbot-web
daumoa
dazoobot
deusu
duckduckbot
duckduckgo-favicons-bot
euripbot
exploratodo
facebot
feedly
findxbot
googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
haosouspider
ichiro
istellabot
jikespider
lycos
mail.ru
mediapartners-google
mojeekbot
msnbot
msnbot-media
orangebot
pinterest
plukkie
qwantify
rambler
seznambot
sosospider
slurp
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider2
sogou web spider
sputnikbot
teoma
twitterbot
wotbox
yacybot
yandex
yandexmobilebot
yeti
yioopbot
yoozbot
youdaobot

Rule Path
Disallow

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://mattboldt.com/sitemap.xml

Comments

  • ROBOTS.TXT
  • Updates can be retrieved from: https://github.com/jonasjacek/robots.txt
  • This document is licensed with a CC BY-NC-SA 4.0 license.
  • Last update: 2017-09-13

Warnings

  • 3 invalid lines.