apawire.smugmug.com
robots.txt

Robots Exclusion Standard data for apawire.smugmug.com

Resource Scan

Scan Details

Site Domain apawire.smugmug.com
Base Domain smugmug.com
Scan Status Ok
Last Scan2024-10-26T00:17:50+00:00
Next Scan 2024-11-25T00:17:50+00:00

Last Scan

Scanned2024-10-26T00:17:50+00:00
URL https://apawire.smugmug.com/robots.txt
Redirect https://www.apawire.com/robots.txt
Redirect Domain www.apawire.com
Redirect Base apawire.com
Domain IPs 13.35.233.159
Redirect IPs 3.223.165.176, 3.224.159.82, 52.44.33.224
Response IP 52.44.33.224
Found Yes
Hash 6d09af9cf2adbfb63ea2946b41e76b103449c73844863d86fd82b768b59d2bfb
SimHash 29b0c3d0d475

Groups

adsbot-google
alexabot
bingpreview
cloudflareprefetch
friendfeedbot
funnelback
google
google favicon
google-site-verification
google-sitemaps
googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
mediapartners-google
pingdom
pinterest
scoutjet
slurp
spinn3r
teoma
twitterbot
yandex
yandeximages
yandexvideoparser
yeti
archive.org_bot
baiduspider
bingbot
facebookexternalhit
gsa-crawler
houzzbot
ia_archiver
msnbot
rogerbot

Rule Path
Disallow /404
Disallow /access-denied
Disallow /admin
Disallow /api
Disallow /cart
Disallow /checkout
Disallow /client
Disallow /date
Disallow /downloads
Disallow /go
Disallow /hack
Disallow /keyword
Disallow /order
Disallow /password
Disallow /popular
Disallow /search
Disallow /services/api/php
Disallow /services/api/rest
Disallow /services/api/xmlrpc
Disallow /test
Allow /api/developer
Allow /api/doc
Allow /api/v2

google
twitterbot
facebookexternalhit
houzzbot

Rule Path
Allow /date
Allow /keyword
Allow /popular

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.apawire.com/sitemap-index.xml

Comments

  • See https://secure.smugmug.com/help/contact if you'd like to apply to be allowlisted for crawling SmugMug.