tinhnguyen.blog
robots.txt

Robots Exclusion Standard data for tinhnguyen.blog

Resource Scan

Scan Details

Site Domain tinhnguyen.blog
Base Domain tinhnguyen.blog
Scan Status Ok
Last Scan2025-08-30T10:18:41+00:00
Next Scan 2025-09-06T10:18:41+00:00

Last Scan

Scanned2025-08-30T10:18:41+00:00
URL https://tinhnguyen.blog/robots.txt
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Response IP 104.21.32.1
Found Yes
Hash 5574f979657b11b237a5ef86cb206f6ae6f6e907224b0561d0cd086f2aab601c
SimHash 51174a008196

Groups

oai-searchbot
chatgpt-user
applebot
facebookexternalhit
peer39_crawler
criteobot
gptbot

Rule Path
Disallow /

perplexitybot
amazonbot
claudebot
omgilibot
facebookbot
anthropic-ai
bytespider
diffbot
imagesiftbot
omgili
youbot
ccbot
piplbot
senutobot
shortpixel
bytedance
meta-externalagent
petalbot
seznambot
mechanize
mj12bot

Rule Path
Disallow /

Comments

  • Block All Other Bots from Entire Site