glcomms.com
robots.txt

Robots Exclusion Standard data for glcomms.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	glcomms.com
Base Domain	glcomms.com
Scan Status	Ok
Last Scan	2024-08-31T23:03:26+00:00
Next Scan	2024-09-30T23:03:26+00:00

Last Scan

Scanned	2024-08-31T23:03:26+00:00
URL	https://www.glcomms.com/robots.txt
Domain IPs	52.6.58.221, 54.156.129.90, 54.174.45.59
Response IP	52.6.58.221
Found	Yes
Hash	3be5db1c0e19d25ceb5d8a373f5d2b2adc3bac902ab1b7b272d27641252d292d
SimHash	29b0c3d0d475

Groups

adsbot-google
alexabot
bingpreview
cloudflareprefetch
friendfeedbot
funnelback
google
google favicon
google-site-verification
google-sitemaps
googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
mediapartners-google
pingdom
pinterest
scoutjet
slurp
spinn3r
teoma
twitterbot
yandex
yandeximages
yandexvideoparser
yeti
archive.org_bot
baiduspider
bingbot
facebookexternalhit
gsa-crawler
houzzbot
ia_archiver
msnbot
rogerbot

Rule	Path
Disallow	/404
Disallow	/access-denied
Disallow	/admin
Disallow	/api
Disallow	/cart
Disallow	/checkout
Disallow	/client
Disallow	/date
Disallow	/downloads
Disallow	/go
Disallow	/hack
Disallow	/keyword
Disallow	/order
Disallow	/password
Disallow	/popular
Disallow	/search
Disallow	/services/api/php
Disallow	/services/api/rest
Disallow	/services/api/xmlrpc
Disallow	/test
Allow	/api/developer
Allow	/api/doc
Allow	/api/v2

Rule

Path

Disallow

/404

Disallow

/access-denied

Disallow

/admin

Disallow

/api

Disallow

/cart

Disallow

/checkout

Disallow

/client

Disallow

/date

Disallow

/downloads

Disallow

/go

Disallow

/hack

Disallow

/keyword

Disallow

/order

Disallow

/password

Disallow

/popular

Disallow

/search

Disallow

/services/api/php

Disallow

/services/api/rest

Disallow

/services/api/xmlrpc

Disallow

/test

Allow

/api/developer

Allow

/api/doc

Allow

/api/v2

google
twitterbot
facebookexternalhit
houzzbot

Rule	Path
Allow	/date
Allow	/keyword
Allow	/popular

Rule

Path

Allow

/date

Allow

/keyword

Allow

/popular

*

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.glcomms.com/sitemap-index.xml

Field

Value

sitemap

https://www.glcomms.com/sitemap-index.xml

Back to top

Comments

See https://secure.smugmug.com/help/contact if you'd like to apply to be allowlisted for crawling SmugMug.

Back to top

glcomms.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googletwitterbotfacebookexternalhithouzzbot

*

Other Records

Comments

glcomms.com
robots.txt

google
twitterbot
facebookexternalhit
houzzbot