gfacct.org
robots.txt

Robots Exclusion Standard data for gfacct.org

Resource Scan

Scan Details

Site Domain gfacct.org
Base Domain gfacct.org
Scan Status Ok
Last Scan2025-10-12T23:11:15+00:00
Next Scan 2025-11-11T23:11:15+00:00

Last Scan

Scanned2025-10-12T23:11:15+00:00
URL https://gfacct.org/robots.txt
Redirect https://www.gfacct.org/robots.txt
Redirect Domain www.gfacct.org
Redirect Base gfacct.org
Domain IPs 104.21.4.44, 172.67.131.166, 2606:4700:3030::6815:42c, 2606:4700:3037::ac43:83a6
Redirect IPs 104.21.4.44, 172.67.131.166, 2606:4700:3030::6815:42c, 2606:4700:3037::ac43:83a6
Response IP 104.21.4.44
Found Yes
Hash d0fccf67a4ebdfaf27e640ceb75e6a910e8286e2052484056ba68f156a4bc7ab
SimHash 634449664230

Groups

*

Rule Path
Disallow /wp-login.php
Disallow /wp-login.php?*
Disallow /wp-register.php
Disallow /xmlrpc.php
Disallow */?*
Disallow */trackback/
Disallow */feed
Disallow */comments/
Disallow */comment
Disallow */attachment/*
Disallow */print/
Disallow *?print=*
Disallow */product/*
Disallow */product_table/*
Disallow */tdb_templates/*
Disallow */tag/*
Disallow */author/*
Allow /wp-content/themes/*.css
Allow /wp-content/plugins/*.css
Allow /wp-content/uploads/*.css
Allow /wp-content/themes/*.js
Allow /wp-content/plugins/*.js
Allow /wp-content/uploads/*.js
Allow /wp-includes/css/
Allow /wp-includes/js/
Allow /wp-includes/images/
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php

googlebot-image

Rule Path
Allow /wp-content/uploads/

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.gfacct.org/sitemap_index.xml