blackpressusa.com
robots.txt

Robots Exclusion Standard data for blackpressusa.com

Resource Scan

Scan Details

Site Domain blackpressusa.com
Base Domain blackpressusa.com
Scan Status Ok
Last Scan2024-11-02T17:42:16+00:00
Next Scan 2024-11-09T17:42:16+00:00

Last Scan

Scanned2024-11-02T17:42:16+00:00
URL https://blackpressusa.com/robots.txt
Domain IPs 192.124.249.4
Response IP 192.124.249.4
Found Yes
Hash c9c1da2f738d2f4b375b6e54f6100b77387abbf6fd5415762eaa19d66f348b67
SimHash 1134db00a020

Groups

semrushbot
siteauditbot
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
splitsignalbot
semrushbot-coub

Rule Path
Disallow /

baiduspider
yisouspider
petalbot
amazonbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Comments

  • Block Semrush bots
  • Block problem bots
  • Block OpenAI
  • Block Google Bard AI
  • Block Common Crawl AI scraper
  • Block Perplexity AI
  • Block other misc AI scrapers

Warnings

  • 1 invalid line.