archidiaries.com
robots.txt

Robots Exclusion Standard data for archidiaries.com

Resource Scan

Scan Details

Site Domain archidiaries.com
Base Domain archidiaries.com
Scan Status Ok
Last Scan2026-01-06T19:34:04+00:00
Next Scan 2026-01-13T19:34:04+00:00

Last Scan

Scanned2026-01-06T19:34:04+00:00
URL https://archidiaries.com/robots.txt
Domain IPs 2a02:4780:84:1c73:6857:634b:4194:c916, 2a02:4780:84:e3b1:3226:859e:aef4:4f32, 77.37.66.160, 93.127.201.222
Response IP 93.127.187.181
Found Yes
Hash c9351c597f89209a3ed2c96e66caaf22c4e166d0ae52a15b6ed615b4461c09b6
SimHash 39085a84a4f3

Groups

*

Rule Path
Disallow

facebookexternalhit

Rule Path
Allow /

facebot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

pinterest

Rule Path
Allow /

*

Rule Path
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.png$
Allow /*.gif$

Other Records

Field Value
sitemap https://archidiaries.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK
  • --- Allow social media crawlers ---
  • --- Extra: Allow image previews for sharing ---