trendinglymail.com
robots.txt

Robots Exclusion Standard data for trendinglymail.com

Resource Scan

Scan Details

Site Domain trendinglymail.com
Base Domain trendinglymail.com
Scan Status Ok
Last Scan2024-10-09T22:48:34+00:00
Next Scan 2024-10-16T22:48:34+00:00

Last Scan

Scanned2024-10-09T22:48:34+00:00
URL https://trendinglymail.com/robots.txt
Redirect https://www.trendinglymail.com/robots.txt
Redirect Domain www.trendinglymail.com
Redirect Base trendinglymail.com
Domain IPs 216.24.57.1
Redirect IPs 216.24.57.252, 216.24.57.4
Response IP 216.24.57.4
Found Yes
Hash 4a82f96c2583b6ffca8a3c5947396da724f05c5cb99dd659d81cd87b6b10377c
SimHash 1a90d20dd7f4

Groups

twitterbot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

blexbot/1.0

Rule Path
Disallow /

linkdexbot/2.0

Rule Path
Disallow /

linkdexbot/2.1

Rule Path
Disallow /

linkdexbot/2.2

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

*

Rule Path
Allow /$
Disallow /

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • Allow Twitterbot in order to read Twitter Cards
  • Allow Google Mediabot for AdSense/AdX
  • Bots
  • Other