trashik.news
robots.txt

Robots Exclusion Standard data for trashik.news

Resource Scan

Scan Details

Site Domain trashik.news
Base Domain trashik.news
Scan Status Ok
Last Scan2024-06-04T21:02:03+00:00
Next Scan 2024-06-11T21:02:03+00:00

Last Scan

Scanned2024-06-04T21:02:03+00:00
URL http://trashik.news/robots.txt
Redirect https://newsmax.in.ua/robots.txt
Redirect Domain newsmax.in.ua
Redirect Base newsmax.in.ua
Redirect IPs 109.94.209.214
Response IP 109.94.209.214
Found Yes
Hash bdf9c341c55bf7971a925fc3972128e809da9315960664df92d9a0b050ca53ef
SimHash 477500b246d0

Groups

petalbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

crowdtanglebot

Rule Path
Disallow /

*

Rule Path
Allow /wp-includes/js/mediaelement/*.js
Allow /wp-includes/js/mediaelement/*.css
Allow /wp-includes/css/dist/block-library/*.css
Allow /wp-content/cache/wmac/css/*.css
Allow /wp-content/themes/*.css
Allow /wp-content/themes/*.js
Allow /wp-content/themes/*.png
Allow /wp-content/themes/*.gif
Allow /wp-content/themes/*.woff
Allow /wp-content/themes/*.ttf
Allow /wp-content/plugins/*.css
Allow /wp-content/plugins/*.js
Allow /wp-content/plugins/*.png
Allow /wp-content/plugins/*.gif
Allow /wp-includes/*.js
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /wp-trackback
Disallow /wp-feed
Disallow /wp-comments
Disallow /?id=*
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow */page/*
Disallow /?ajax-request=*

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://newsmax.in.ua/sitemap_index.xml

Warnings

  • `host` is not a known field.