media-match.com
robots.txt

Robots Exclusion Standard data for media-match.com

Resource Scan

Scan Details

Site Domain media-match.com
Base Domain media-match.com
Scan Status Ok
Last Scan2024-05-26T03:35:29+00:00
Next Scan 2024-06-25T03:35:29+00:00

Last Scan

Scanned2024-05-26T03:35:29+00:00
URL https://media-match.com/robots.txt
Domain IPs 107.23.105.167, 3.219.87.251
Response IP 3.219.87.251
Found Yes
Hash ce42ae0b4a6c32100c5cd5e108fe7be91a91dfe5fa2149155c697b4fb10d6eb3
SimHash 5c5093204b15

Groups

*

Rule Path
Disallow /

googlebot

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

chrome-lighthouse

Rule Path
Allow /

slurp

Rule Path
Allow /

msnbot

Rule Path
Allow /

bingbot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

qwantify

Rule Path
Allow /

indeed

Rule Path
Allow /

jobrapido

Rule Path
Allow /

trovitbot

Rule Path
Allow /

jooblebot

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

yandex

Rule Path
Disallow /

yandexvideoparser

Rule Path
Disallow /

buzzbot

Rule Path
Disallow /

trendiction

Rule Path
Disallow /

hubspot

Rule Path
Disallow /

genieo

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

exabot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

linkdex

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

baidu

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

spiderbook

Rule Path
Disallow /