topmags.com
robots.txt

Robots Exclusion Standard data for topmags.com

Resource Scan

Scan Details

Site Domain topmags.com
Base Domain topmags.com
Scan Status Ok
Last Scan2024-09-28T10:19:48+00:00
Next Scan 2024-10-05T10:19:48+00:00

Last Scan

Scanned2024-09-28T10:19:48+00:00
URL https://topmags.com/robots.txt
Redirect https://www.topmags.com:443/robots.txt
Redirect Domain www.topmags.com
Redirect Base topmags.com
Domain IPs 100.29.81.51, 18.214.190.13
Redirect IPs 100.29.81.51, 18.214.190.13
Response IP 100.29.81.51
Found Yes
Hash 2deb40acc1afd9711c0913fee97652480276b6e428f9b239a92dda4194174b9a
SimHash 49154ce0c551

Groups

blexbot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /process

Other Records

Field Value
sitemap https://www.topmags.com/sitemap.xml