perniknews.com
robots.txt

Robots Exclusion Standard data for perniknews.com

Resource Scan

Scan Details

Site Domain perniknews.com
Base Domain perniknews.com
Scan Status Ok
Last Scan4/10/2025, 10:19:40 PM
Next Scan 4/17/2025, 10:19:40 PM

Last Scan

Scanned4/10/2025, 10:19:40 PM
URL https://perniknews.com/robots.txt
Domain IPs 104.21.51.76, 172.67.177.65, 2606:4700:3030::ac43:b141, 2606:4700:3037::6815:334c
Response IP 104.21.51.76
Found Yes
Hash f382f9d8668dbc2a56b60fc832b45c4088f9d5908f00dcb5a4473ed72c51cf2c
SimHash 0941b333ce9a

Groups

*

Rule Path
Allow /
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content
Disallow /wp-login.php
Disallow /trackback
Disallow /comments
Disallow */trackback
Disallow */comments

screaming frog seo spider

Rule Path
Allow /
Disallow /*?*

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

ia_archiver-web.archive.org

Rule Path
Disallow /

Other Records

Field Value
sitemap http://domain.com/sitemap.xml
sitemap https://perniknews.com/sitemap_index.xml

Comments

  • disallow all files in these directories
  • disallow all files with ? in url