globalmedia.mx
robots.txt

Robots Exclusion Standard data for globalmedia.mx

Archived Snapshots

Resource Scan

Scan Details

Site Domain	globalmedia.mx
Base Domain	globalmedia.mx
Scan Status	Ok
Last Scan	2024-10-29T02:35:10+00:00
Next Scan	2024-11-28T02:35:10+00:00

Last Scan

Scanned	2024-10-29T02:35:10+00:00
URL	https://globalmedia.mx/robots.txt
Domain IPs	34.201.80.84, 54.157.4.65, 54.196.16.164, 54.91.6.89
Response IP	54.196.16.164
Found	Yes
Hash	27c7cc19a2123166d304e56cf6ec5641ea5a66c31ec3c8f96d1160c376a1b1ad
SimHash	38941f173ca6

Groups

*

Rule	Path
Disallow	/angular/
Disallow	/assets/

Rule

Path

Disallow

/angular/

Disallow

/assets/

genio

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

scooperbot

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

flamingo_searchengine

Rule	Path
Disallow	/

Rule

Path

Disallow

facebot

Rule	Path
Disallow	/

Rule

Path

Disallow

luminatebot

Rule	Path
Disallow	/

Rule

Path

Disallow

vagabondo

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

r6_commentreader

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

heritrix

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

showyoubot

Rule	Path
Disallow	/

Rule

Path

Disallow

gozaikbot

Rule	Path
Disallow	/

Rule

Path

Disallow

python-requests

Rule	Path
Disallow	/

Rule

Path

Disallow

queryseekerspider

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandeximages

Rule	Path
Disallow	/

Rule

Path

Disallow

apache-httpclient

Rule	Path
Disallow	/

Rule

Path

Disallow

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

buck

Rule	Path
Disallow	/

Rule

Path

Disallow

wikido

Rule	Path
Disallow	/

Rule

Path

Disallow

zoominfobot

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou

Rule	Path
Disallow	/

Rule

Path

Disallow

zend_http_client

Rule	Path
Disallow	/

Rule

Path

Disallow

robots

Rule	Path
Disallow	/

Rule

Path

Disallow

arquivo-web-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

bidswitchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

g-i-g-a-b-o-t

Rule	Path
Disallow	/

Rule

Path

Disallow

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

garlikcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

caam

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

clickagy intelligence bot

Rule	Path
Disallow	/

Rule

Path

Disallow

jersey

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww-perl

Rule	Path
Disallow	/

Rule

Path

Disallow

ltx71

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

python-urllib

Rule	Path
Disallow	/

Rule

Path

Disallow

zoominfobot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.globalmedia.mx/sitemap.xml
sitemap	https://www.globalmedia.mx/google_news.xml

Field

Value

sitemap

https://www.globalmedia.mx/sitemap.xml

sitemap

https://www.globalmedia.mx/google_news.xml

Comments

See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-agent: *

globalmedia.mxrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

genio

mj12bot

scooperbot

rogerbot

flamingo_searchengine

facebot

luminatebot

vagabondo

ahrefsbot

seznambot

r6_commentreader

yeti

heritrix

baiduspider

showyoubot

gozaikbot

python-requests

queryseekerspider

dotbot

yandeximages

apache-httpclient

piplbot

scrapy

buck

wikido

zoominfobot

sogou

zend_http_client

robots

arquivo-web-crawler

bidswitchbot

g-i-g-a-b-o-t

gigabot

garlikcrawler

caam

ccbot

clickagy intelligence bot

jersey

libwww-perl

ltx71

omgili

piplbot

python-urllib

zoominfobot

Other Records

Comments

globalmedia.mx
robots.txt