trefamiglia.com
robots.txt

Robots Exclusion Standard data for trefamiglia.com

Resource Scan

Scan Details

Site Domain trefamiglia.com
Base Domain trefamiglia.com
Scan Status Ok
Last Scan2026-01-05T02:44:09+00:00
Next Scan 2026-02-04T02:44:09+00:00

Last Scan

Scanned2026-01-05T02:44:09+00:00
URL https://www.trefamiglia.com/robots.txt
Domain IPs 35.212.40.238
Response IP 35.212.40.238
Found Yes
Hash 348563d69eb16e8eaa45faf554abc9c246ed4bf3e97f4d171a2ed2909417603e
SimHash a95d557025d9

Groups

*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /components/
Disallow /images/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /tmp/
Disallow /xmlrpc/

Other Records

Field Value
sitemap http://cdn.attracta.com/sitemap/341218.xml.gz
sitemap http://cdn.attracta.com/sitemap/1302198.xml.gz

Comments

  • Begin Attracta SEO Tools Sitemap. Do not remove
  • End Attracta SEO Tools Sitemap. Do not remove