f1i.com
robots.txt

Robots Exclusion Standard data for f1i.com

Resource Scan

Scan Details

Site Domain f1i.com
Base Domain f1i.com
Scan Status Ok
Last Scan2024-04-27T20:19:06+00:00
Next Scan 2024-05-04T20:19:06+00:00

Last Scan

Scanned2024-04-27T20:19:06+00:00
URL https://f1i.com/robots.txt
Domain IPs 51.91.51.74
Response IP 51.91.51.74
Found Yes
Hash a8539cc58f6124fa2448408d884a447ee6de0f8127cf50ead780247baac1d2dc
SimHash c744ce144620

Groups

nutch

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

spiderbot/nutch-1.7

Rule Path
Disallow /

*

Rule Path
Disallow /*?
Disallow /wp-login.php
Disallow /wp-admin
Disallow /wp-includes
Allow /wp-content/uploads
Disallow /wp-www
Allow /wp-www/css
Allow /wp-www/js
Disallow */trackback
Disallow /*/feed
Disallow /*/comments
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi
Allow /*.css$
Allow /*.js$
Allow *.js
Allow *.css
Allow ads.txt
Allow sitemap_index.xml

googlebot-image

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow