sm.news
robots.txt

Robots Exclusion Standard data for sm.news

Resource Scan

Scan Details

Site Domain sm.news
Base Domain sm.news
Scan Status Ok
Last Scan2024-09-28T10:36:53+00:00
Next Scan 2024-10-05T10:36:53+00:00

Last Scan

Scanned2024-09-28T10:36:53+00:00
URL https://sm.news/robots.txt
Domain IPs 5.188.119.182
Response IP 5.188.119.182
Found Yes
Hash 251890fa10089061ae5b9f51f25b255608ee76133b44601484601b8e5f2295a4
SimHash 220d9ee0cb90

Groups

*

Rule Path
Disallow /xmlrpc.php
Disallow /cgi-bin
Disallow /wp-json
Disallow /wp-admin
Disallow /wp-content/cache
Disallow /author/*
Disallow /*/page/*
Disallow /*nocache%3D1*
Disallow /*?*
Disallow /*/feed/
Disallow /*/feed
Allow /*/feed.php
Allow /wp-includes/*.css?*
Allow /wp-includes/*.js?*
Allow /wp-content/plugins/*.css?*
Allow /wp-content/plugins/*.js?*
Allow /wp-content/themes/*.css?*
Allow /wp-content/themes/*.js?*
Allow /sitemap_article.php?*
Allow /sitemap_archive.php?*

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sm.news/sitemap_index.php