osmelhoresdoces.com
robots.txt

Robots Exclusion Standard data for osmelhoresdoces.com

Resource Scan

Scan Details

Site Domain osmelhoresdoces.com
Base Domain osmelhoresdoces.com
Scan Status Ok
Last Scan2024-11-04T22:16:48+00:00
Next Scan 2024-11-11T22:16:48+00:00

Last Scan

Scanned2024-11-04T22:16:48+00:00
URL https://osmelhoresdoces.com/robots.txt
Domain IPs 104.21.11.75, 172.67.148.136, 2606:4700:3032::6815:b4b, 2606:4700:3035::ac43:9488
Response IP 104.21.11.75
Found Yes
Hash dd9078da4fcb739422364b373aa51902a3ba88a422d5224b854239eb2b89fdf4
SimHash c800d09407f3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-content/uploads/

archive.org_bot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

lusa.pt

Rule Path
Disallow /

fairlicensing.com

Rule Path
Disallow /

arquivo.pt

Rule Path
Disallow /

twitterbot

Rule Path
Allow /wp-content/uploads/

facebookexternalhit

Rule Path
Allow /

arquivo-web-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://osmelhoresdoces.com/sitemap.xml