munsifdaily.com
robots.txt

Robots Exclusion Standard data for munsifdaily.com

Resource Scan

Scan Details

Site Domain munsifdaily.com
Base Domain munsifdaily.com
Scan Status Ok
Last Scan2024-10-30T02:23:15+00:00
Next Scan 2024-11-06T02:23:15+00:00

Last Scan

Scanned2024-10-30T02:23:15+00:00
URL https://munsifdaily.com/robots.txt
Domain IPs 104.21.56.166, 172.67.187.45, 2606:4700:3032::ac43:bb2d, 2606:4700:3033::6815:38a6
Response IP 104.21.56.166
Found Yes
Hash 4195b5d7b568a14124e989ad158f4b1eb5152ce8a1184851d98cd517878ca370
SimHash 680c59536ef9

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /*/feed/
Disallow /*/?expand_article=1
Disallow /*/?amp=1
Disallow /*/?noamp=mobile
Disallow /?p=*
Disallow /tag/
Disallow /live_update/
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /cgi-bin/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /readme.html
Disallow /license.txt
Allow /

Other Records

Field Value
sitemap https://munsifdaily.com/sitemaps.xml

Comments

  • Robots.txt for WordPress
  • Disallow URLs containing
  • Disallow tag archives
  • Disallow custom post type
  • Sitemap: specify the path to your sitemap (if you have one)
  • Prevent crawlers from indexing pages that are likely to be of little value to them
  • Block access to specific directories and files that are not meant for crawling
  • Allow crawling of all content