carnediem.blog
robots.txt

Robots Exclusion Standard data for carnediem.blog

Resource Scan

Scan Details

Site Domain carnediem.blog
Base Domain carnediem.blog
Scan Status Ok
Last Scan2024-10-22T10:48:18+00:00
Next Scan 2024-10-29T10:48:18+00:00

Last Scan

Scanned2024-10-22T10:48:18+00:00
URL https://carnediem.blog/robots.txt
Domain IPs 104.21.61.202, 172.67.214.113, 2606:4700:3033::ac43:d671, 2606:4700:3037::6815:3dca
Response IP 104.21.61.202
Found Yes
Hash 12ae408d39ffbb728bf7acc4ebee292a816ffc150e2fd063037049ab9c29b0fc
SimHash 0a6c6c80cd16

Groups

*

Rule Path
Disallow /*blackhole
Disallow /?blackhole

amazon-kendra-web-crawler-*

Product Comment
amazon-kendra-web-crawler-* all customers of Amazon Kendra's web crawler
Rule Path Comment
Disallow / disallow everything

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://carnediem.blog/sitemap.xml
sitemap https://carnediem.blog/news-sitemap.xml
sitemap https://carnediem.blog/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK