newstodayintheworld.com
robots.txt

Robots Exclusion Standard data for newstodayintheworld.com

Resource Scan

Scan Details

Site Domain newstodayintheworld.com
Base Domain newstodayintheworld.com
Scan Status Ok
Last Scan2024-11-15T10:26:50+00:00
Next Scan 2024-11-22T10:26:50+00:00

Last Scan

Scanned2024-11-15T10:26:50+00:00
URL https://newstodayintheworld.com/robots.txt
Domain IPs 213.159.214.120
Response IP 213.159.214.120
Found Yes
Hash 725800cbd4642dc423f56e9a42493b99fc61b3746359d8961aca8a300c506932
SimHash 6b609a67aea0

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /trackback
Disallow /feed
Disallow /feeds
Disallow /wprss
Disallow /comments
Disallow /category/*/*
Disallow */trackback
Disallow */feed
Disallow */feeds
Disallow */wprss
Disallow */comments
Disallow /*?*
Disallow /feed/
Disallow /feeds/
Disallow /wprss/
Disallow /archives/
Disallow /category/
Disallow /search/
Disallow /page/
Disallow */page/
Disallow */*/page/
Disallow */page
Disallow */*/page
Disallow /tag/
Disallow /xmlrpc.php
Allow /wp-content/uploads
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://newstodayintheworld.com/sitemap.xml

Warnings

  • `host` is not a known field.