loradicarlo.com
robots.txt

Robots Exclusion Standard data for loradicarlo.com

Resource Scan

Scan Details

Site Domain loradicarlo.com
Base Domain loradicarlo.com
Scan Status Ok
Last Scan4/14/2025, 2:58:26 AM
Next Scan 5/14/2025, 2:58:26 AM

Last Scan

Scanned4/14/2025, 2:58:26 AM
URL https://loradicarlo.com/robots.txt
Redirect https://lebanondailyrecord.com/robots.txt
Redirect Domain lebanondailyrecord.com
Redirect Base lebanondailyrecord.com
Domain IPs 104.21.43.104, 172.67.178.34, 2606:4700:3033::6815:2b68, 2606:4700:3036::ac43:b222
Redirect IPs 104.26.2.234, 104.26.3.234, 172.67.72.19, 2606:4700:20::681a:2ea, 2606:4700:20::681a:3ea, 2606:4700:20::ac43:4813
Response IP 172.67.72.19
Found Yes
Hash 281a23a745123821c4c292321600d1b27e392060360d3b92d2edcc6578fd984f
SimHash 6949d804cb22

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /author/
Disallow /*/trackback
Disallow /img/
Disallow /tag/
Disallow /feed
Disallow /*/feed
Disallow /comments/feed
Disallow /?s=*
Disallow /attachment/

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://lebanondailyrecord.com/sitemap.xml
sitemap https://xoilacyyx.cc/sitemap.xml

Comments

  • Allow Facebook scraper