blogagc.com
robots.txt

Robots Exclusion Standard data for blogagc.com

Resource Scan

Scan Details

Site Domain blogagc.com
Base Domain blogagc.com
Scan Status Ok
Last Scan2024-09-19T00:45:37+00:00
Next Scan 2024-09-26T00:45:37+00:00

Last Scan

Scanned2024-09-19T00:45:37+00:00
URL https://blogagc.com/robots.txt
Redirect https://www.blogagc.com/robots.txt
Redirect Domain www.blogagc.com
Redirect Base blogagc.com
Domain IPs 2001:4860:4802:32::15, 2001:4860:4802:34::15, 2001:4860:4802:36::15, 2001:4860:4802:38::15, 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 172.253.118.121, 2404:6800:4003:c1a::79
Response IP 142.251.10.121
Found Yes
Hash 56748bbc745da7f7b20e6e9d86dab1e83ccb6d3caf5fdd896ac163adf9b37f2d
SimHash 2990dbb7c5fd

Groups

mediapartners-google
googlebot
googlebot-news
googlebot-image
googlebot-mobile
feedfetcher-google
mediapartners-google
adsbot-google
adsbot-google-mobile
bingbot
msnbot
msnbot-media
adidxbot
bingpreview
yandex
yandexwebmaster
yandexbot
yandexblogs
yandexnews
yandexmobilebot
yandeximages
yandexmedia
yandexsitelinks
yandexpagechecker
yandexpartner
yandexspravbot
yandexmobilescreenshotbot
yandeximageresizer
slurp
yahoo pipes 1.0
yahoo! slurp
archive.org_bot
duckduckbot
ia_archiver
applewebkit
facebot
facebookexternalhit
linkedinbot
twitterbot
baiduspider
baiduspider-image
baiduspider-news
sogou web spider
sogou pic spider
sogou head spider
sogou orion spider
sogou-test-spider
speedy spider
*

Rule Path
Disallow /search/
Disallow /wp-content/
Disallow /megamenu/
Disallow /p/
Allow /

Other Records

Field Value
sitemap https://www.blogagc.com/sitemap.xml
sitemap https://blogagc.com/sitemap.xml