nancy.cc
robots.txt

Robots Exclusion Standard data for nancy.cc

Resource Scan

Scan Details

Site Domain nancy.cc
Base Domain nancy.cc
Scan Status Ok
Last Scan2024-09-21T11:13:59+00:00
Next Scan 2024-09-28T11:13:59+00:00

Last Scan

Scanned2024-09-21T11:13:59+00:00
URL https://nancy.cc/robots.txt
Domain IPs 160.153.42.136
Response IP 160.153.42.136
Found Yes
Hash bb47588769d12095b0c3a92d97014faab88ef3e9e4268571618bfe5c25ceb118
SimHash 4155ae12e217

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /trackback
Disallow /feed
Disallow /comments
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow /*?*
Disallow /*?
Allow /wp-content/uploads

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

duggmirror

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mozilla/5.0 (compatible;contxbot/1.0)

Rule Path
Disallow

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nancy.cc/sitemap.xml

Comments

  • Google Image
  • Google AdSense
  • digg mirror
  • AhrefsBot
  • Amazon Associates
  • GPTBot