simpleprogrammer.com
robots.txt

Robots Exclusion Standard data for simpleprogrammer.com

Resource Scan

Scan Details

Site Domain simpleprogrammer.com
Base Domain simpleprogrammer.com
Scan Status Ok
Last Scan2024-11-09T20:29:13+00:00
Next Scan 2024-11-16T20:29:13+00:00

Last Scan

Scanned2024-11-09T20:29:13+00:00
URL https://simpleprogrammer.com/robots.txt
Domain IPs 147.135.37.193
Response IP 147.135.37.193
Found Yes
Hash edcba41f5a30f613eff481da186d6476da82f4b5f4f1eb79702e9a8489629876
SimHash a3092573ddb1

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /category*
Disallow /tag/*
Disallow /author/*
Disallow /email
Disallow /lp/*
Disallow /ss-*
Disallow /yt/*
Disallow /20*
Disallow */embed/
Disallow */feed/
Disallow /privacy-policy/
Disallow /members/
Disallow /cg*
Disallow *lang%3D*
Disallow *orderby%3D*
Disallow *add-to-cart%3D*
Disallow *p%3D*
Disallow *post_type%3D*
Allow /2017-*

Other Records

Field Value
sitemap https://simpleprogrammer.com/sitemap_index.xml
sitemap https://simpleprogrammer.com/HTTP_Sitemap_SP.xml
sitemap https://simpleprogrammer.com/store/sitemap_index.xml

Comments

  • Removing excess pages which are noindexed and provide no value to Google
  • Removing some random URL parameters
  • Allow several posts with year in slug