media-match.com
robots.txt

Robots Exclusion Standard data for media-match.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	media-match.com
Base Domain	media-match.com
Scan Status	Ok
Last Scan	2024-05-26T03:35:29+00:00
Next Scan	2024-06-25T03:35:29+00:00

Last Scan

Scanned	2024-05-26T03:35:29+00:00
URL	https://media-match.com/robots.txt
Domain IPs	107.23.105.167, 3.219.87.251
Response IP	3.219.87.251
Found	Yes
Hash	ce42ae0b4a6c32100c5cd5e108fe7be91a91dfe5fa2149155c697b4fb10d6eb3
SimHash	5c5093204b15

Groups

*

Rule	Path
Disallow	/

Rule

Path

Disallow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-mobile

Rule	Path
Allow	/

Rule

Path

Allow

adsbot-google

Rule	Path
Allow	/

Rule

Path

Allow

chrome-lighthouse

Rule	Path
Allow	/

Rule

Path

Allow

slurp

Rule	Path
Allow	/

Rule

Path

Allow

msnbot

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

duckduckbot

Rule	Path
Allow	/

Rule

Path

Allow

qwantify

Rule	Path
Allow	/

Rule

Path

Allow

indeed

Rule	Path
Allow	/

Rule

Path

Allow

jobrapido

Rule	Path
Allow	/

Rule

Path

Allow

trovitbot

Rule	Path
Allow	/

Rule

Path

Allow

jooblebot

Rule	Path
Allow	/

Rule

Path

Allow

linkedinbot

Rule	Path
Allow	/

Rule

Path

Allow

twitterbot

Rule	Path
Allow	/

Rule

Path

Allow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexvideoparser

Rule	Path
Disallow	/

Rule

Path

Disallow

buzzbot

Rule	Path
Disallow	/

Rule

Path

Disallow

trendiction

Rule	Path
Disallow	/

Rule

Path

Disallow

hubspot

Rule	Path
Disallow	/

Rule

Path

Disallow

genieo

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

flipboardproxy

Rule	Path
Disallow	/

Rule

Path

Disallow

exabot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

linkdex

Rule	Path
Disallow	/

Rule

Path

Disallow

linkdexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

baidu

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

spiderbook

Rule	Path
Disallow	/

Rule

Path

Disallow

media-match.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot

mediapartners-google

googlebot-mobile

adsbot-google

chrome-lighthouse

slurp

msnbot

bingbot

duckduckbot

qwantify

indeed

jobrapido

trovitbot

jooblebot

linkedinbot

twitterbot

yandex

yandexvideoparser

buzzbot

trendiction

hubspot

genieo

rogerbot

piplbot

flipboardproxy

exabot

mj12bot

linkdex

linkdexbot

baidu

baiduspider

ahrefsbot

spiderbook

media-match.com
robots.txt