trefamiglia.com
robots.txt

Robots Exclusion Standard data for trefamiglia.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	trefamiglia.com
Base Domain	trefamiglia.com
Scan Status	Ok
Last Scan	2026-01-05T02:44:09+00:00
Next Scan	2026-02-04T02:44:09+00:00

Last Scan

Scanned	2026-01-05T02:44:09+00:00
URL	https://www.trefamiglia.com/robots.txt
Domain IPs	35.212.40.238
Response IP	35.212.40.238
Found	Yes
Hash	348563d69eb16e8eaa45faf554abc9c246ed4bf3e97f4d171a2ed2909417603e
SimHash	a95d557025d9

Groups

*

Rule	Path
Disallow	/administrator/
Disallow	/cache/
Disallow	/components/
Disallow	/images/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/libraries/
Disallow	/media/
Disallow	/modules/
Disallow	/plugins/
Disallow	/templates/
Disallow	/tmp/
Disallow	/xmlrpc/

Rule

Path

Disallow

/administrator/

Disallow

/cache/

Disallow

/components/

Disallow

/images/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/libraries/

Disallow

/media/

Disallow

/modules/

Disallow

/plugins/

Disallow

/templates/

Disallow

/tmp/

Disallow

/xmlrpc/

Back to top

Other Records

Field	Value
sitemap	http://cdn.attracta.com/sitemap/341218.xml.gz
sitemap	http://cdn.attracta.com/sitemap/1302198.xml.gz

Field

Value

sitemap

http://cdn.attracta.com/sitemap/341218.xml.gz

sitemap

http://cdn.attracta.com/sitemap/1302198.xml.gz

Back to top

Comments

Begin Attracta SEO Tools Sitemap. Do not remove
End Attracta SEO Tools Sitemap. Do not remove

Back to top

trefamiglia.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

trefamiglia.com
robots.txt