chicbrides.com.mx
robots.txt

Robots Exclusion Standard data for chicbrides.com.mx

Archived Snapshots

Resource Scan

Scan Details

Site Domain	chicbrides.com.mx
Base Domain	chicbrides.com.mx
Scan Status	Ok
Last Scan	2024-11-16T12:15:07+00:00
Next Scan	2024-11-23T12:15:07+00:00

Last Scan

Scanned	2024-11-16T12:15:07+00:00
URL	https://chicbrides.com.mx/robots.txt
Redirect	https://www.chicmagazine.com.mx/robots.txt
Redirect Domain	www.chicmagazine.com.mx
Redirect Base	chicmagazine.com.mx
Domain IPs	18.155.68.16, 18.155.68.33, 18.155.68.59, 18.155.68.8
Redirect IPs	216.137.52.109, 216.137.52.11, 216.137.52.67, 216.137.52.95
Response IP	18.165.140.53
Found	Yes
Hash	81b222b8ba530d9c56074c8bbd75fc009d5dbee2a0bbe3701f19dd689514ebab
SimHash	b8161f032d65

Groups

*

Rule	Path
Disallow	/node/
Disallow	/cdb/
Disallow	/wp-content/
Disallow	/sites/
Disallow	/Topicos/
Disallow	/7198/
Disallow	/noticias/
Disallow	/bbtstats/
Disallow	/bbtfile/
Disallow	/feed/
Disallow	/rss7/
Disallow	/rss10/
Disallow	/MediaCenter/
Disallow	/portal/

Rule

Path

Disallow

/node/

Disallow

/cdb/

Disallow

/wp-content/

Disallow

/sites/

Disallow

/Topicos/

Disallow

/7198/

Disallow

/noticias/

Disallow

/bbtstats/

Disallow

/bbtfile/

Disallow

/feed/

Disallow

/rss7/

Disallow

/rss10/

Disallow

/MediaCenter/

Disallow

/portal/

genio

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

scooperbot

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

flamingo_searchengine

Rule	Path
Disallow	/

Rule

Path

Disallow

facebot

Rule	Path
Disallow	/

Rule

Path

Disallow

luminatebot

Rule	Path
Disallow	/

Rule

Path

Disallow

vagabondo

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

r6_commentreader

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

heritrix

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

showyoubot

Rule	Path
Disallow	/

Rule

Path

Disallow

gozaikbot

Rule	Path
Disallow	/

Rule

Path

Disallow

python-requests

Rule	Path
Disallow	/

Rule

Path

Disallow

queryseekerspider

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandeximages

Rule	Path
Disallow	/

Rule

Path

Disallow

apache-httpclient

Rule	Path
Disallow	/

Rule

Path

Disallow

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

buck

Rule	Path
Disallow	/

Rule

Path

Disallow

wikido

Rule	Path
Disallow	/

Rule

Path

Disallow

zoominfobot

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou

Rule	Path
Disallow	/

Rule

Path

Disallow

zend_http_client

Rule	Path
Disallow	/

Rule

Path

Disallow

robots

Rule	Path
Disallow	/

Rule

Path

Disallow

arquivo-web-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

bidswitchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

g-i-g-a-b-o-t

Rule	Path
Disallow	/

Rule

Path

Disallow

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

garlikcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

caam

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

clickagy intelligence bot

Rule	Path
Disallow	/

Rule

Path

Disallow

jersey

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww-perl

Rule	Path
Disallow	/

Rule

Path

Disallow

ltx71

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

python-urllib

Rule	Path
Disallow	/

Rule

Path

Disallow

zoominfobot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.chicmagazine.com.mx/sitemap/sitemap-articles-index.xml
sitemap	https://www.chicmagazine.com.mx/sitemap/sitemap-google-news-index.xml
sitemap	https://www.chicmagazine.com.mx/sitemap/sitemap-tags-index.xml
sitemap	https://www.chicmagazine.com.mx/sitemap/sitemap-images-index.xml
sitemap	https://www.chicmagazine.com.mx/sitemap/sitemap-videos-index.xml

Field

Value

sitemap

https://www.chicmagazine.com.mx/sitemap/sitemap-articles-index.xml

sitemap

https://www.chicmagazine.com.mx/sitemap/sitemap-google-news-index.xml

sitemap

https://www.chicmagazine.com.mx/sitemap/sitemap-tags-index.xml

sitemap

https://www.chicmagazine.com.mx/sitemap/sitemap-images-index.xml

sitemap

https://www.chicmagazine.com.mx/sitemap/sitemap-videos-index.xml

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html

chicbrides.com.mxrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

genio

mj12bot

scooperbot

rogerbot

flamingo_searchengine

facebot

luminatebot

vagabondo

ahrefsbot

seznambot

r6_commentreader

yeti

heritrix

baiduspider

showyoubot

gozaikbot

python-requests

queryseekerspider

dotbot

yandeximages

apache-httpclient

piplbot

scrapy

buck

wikido

zoominfobot

sogou

zend_http_client

robots

arquivo-web-crawler

bidswitchbot

g-i-g-a-b-o-t

gigabot

garlikcrawler

caam

ccbot

clickagy intelligence bot

jersey

libwww-perl

ltx71

omgili

piplbot

python-urllib

zoominfobot

grapeshot

Other Records

Comments

chicbrides.com.mx
robots.txt