techtargetmedia.com
robots.txt

Robots Exclusion Standard data for techtargetmedia.com

Resource Scan

Scan Details

Site Domain techtargetmedia.com
Base Domain techtargetmedia.com
Scan Status Ok
Last Scan2025-11-06T14:34:27+00:00
Next Scan 2025-12-06T14:34:27+00:00

Last Scan

Scanned2025-11-06T14:34:27+00:00
URL https://techtargetmedia.com/robots.txt
Domain IPs 198.54.115.91
Response IP 198.54.115.91
Found Yes
Hash 981dbd4d79b90b4e13e4752e83b3d48d2b61cffa581c5512539d356e0a0a679e
SimHash 42248160f7f0

Groups

*

Product Comment
* applies to all robots
Rule Path
Disallow /*/feed/$
Disallow /?nonamp=1%2F
Disallow /?amp=1
Disallow /?noamp=mobile
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

googlebot

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

boomtrain-content-bot*

Rule Path
Disallow
Allow /*

googlebot-image

Rule Path
Disallow
Allow /*

adsbot-google

Rule Path
Disallow
Allow /*

googlebot-news

Rule Path
Disallow
Allow /*

bingbot

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

baiduspider

Rule Path
Disallow

yandexbot

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

teoma

Rule Path
Disallow

rogerbot

Rule Path
Disallow

rogerbot/1.2

Rule Path
Disallow

dotbot

Rule Path
Disallow

dotbot/1.1

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow
Allow /*

mj12bot

Rule Path
Disallow
Allow /*

semrushbot

Rule Path
Disallow
Allow /*

ninjabot

Rule Path
Disallow

facebot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

linkedinbot

Rule Path
Disallow

Comments

  • Adding Multiple Sitemaps
  • Allowed Good User Agents for better Crawl

Warnings

  • 4 invalid lines.