nufcblog.com
robots.txt

Robots Exclusion Standard data for nufcblog.com

Resource Scan

Scan Details

Site Domain nufcblog.com
Base Domain nufcblog.com
Scan Status Ok
Last Scan2024-10-05T06:59:51+00:00
Next Scan 2024-10-12T06:59:51+00:00

Last Scan

Scanned2024-10-05T06:59:51+00:00
URL https://nufcblog.com/robots.txt
Domain IPs 2604:4f00:10:510a:0:20:746:1, 74.114.91.194
Response IP 74.114.91.194
Found Yes
Hash ffd49babe95d958e5df8082e04338c7fb6c95ae692198d75e55c69f6107166f6
SimHash a4429850c561

Groups

*

Rule Path
Allow /

Other Records

Field Value
sitemap http://www.nufcblog.com/sitemap.xml.gz

Comments

  • Mediapartners-Google moved to the top (and Disallow: /wp-includes/ not used) as per ticket 1569922
  • The following 2 lines were disabled by Tiger Technologies - see ticket 1826975
  • User-agent: Mediapartners-Google
  • Disallow: /
  • Prior location for listing Mediapartners-Google
  • Disallow:/wp-includes/
  • Disallow:/?p=*</em>
  • BEGIN XML-SITEMAP-PLUGIN
  • END XML-SITEMAP-PLUGIN