numerama.com
robots.txt

Robots Exclusion Standard data for numerama.com

Resource Scan

Scan Details

Site Domain numerama.com
Base Domain numerama.com
Scan Status Ok
Last Scan2024-05-25T11:04:44+00:00
Next Scan 2024-06-01T11:04:44+00:00

Last Scan

Scanned2024-05-25T11:04:44+00:00
URL https://numerama.com/robots.txt
Redirect https://www.numerama.com/robots.txt
Redirect Domain www.numerama.com
Redirect Base numerama.com
Domain IPs 104.26.14.117, 104.26.15.117, 172.67.73.247, 2606:4700:20::681a:e75, 2606:4700:20::681a:f75, 2606:4700:20::ac43:49f7
Redirect IPs 104.26.14.117, 104.26.15.117, 172.67.73.247, 2606:4700:20::681a:e75, 2606:4700:20::681a:f75, 2606:4700:20::ac43:49f7
Response IP 104.26.14.117
Found Yes
Hash a875f835107d15ecd4e6a104f96b40d54c321c8c1b1ee5b1d464be9c7e35f06c
SimHash 6a0d9818af91

Groups

*

Rule Path
Disallow /search/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /numeramaplus/refresh/
Disallow /wp-content/themes/numerama-next/misc/

gptbot

Rule Path
Disallow /
Disallow /ajax-postviews.php

Other Records

Field Value
sitemap https://www.numerama.com/sitemap_index.xml
sitemap https://www.numerama.com/news-sitemap.xml
sitemap https://www.numerama.com/telecharger/telechargements.xml.gz

Comments

  • TODO: remove after postViews deletion (24/03/2020)