samaa.tv
robots.txt

Robots Exclusion Standard data for samaa.tv

Resource Scan

Scan Details

Site Domain samaa.tv
Base Domain samaa.tv
Scan Status Ok
Last Scan2024-09-15T21:32:28+00:00
Next Scan 2024-09-22T21:32:28+00:00

Last Scan

Scanned2024-09-15T21:32:28+00:00
URL https://samaa.tv/robots.txt
Redirect https://www.samaa.tv/robots.txt
Redirect Domain www.samaa.tv
Redirect Base samaa.tv
Domain IPs 104.21.63.234, 172.67.173.23, 2606:4700:3031::6815:3fea, 2606:4700:3035::ac43:ad17
Redirect IPs 104.21.63.234, 172.67.173.23, 2606:4700:3031::6815:3fea, 2606:4700:3035::ac43:ad17
Response IP 172.67.173.23
Found Yes
Hash 33fd9c8d83a573859ff7bb66d299f2bac8c958bad09f3c40be873f91f97f4aee
SimHash a01ed731e783

Groups

dotbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

facebookexternalhit/

Rule Path
Allow /

linkedinbot
amazonbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

pinterestbot

Rule Path
Allow /

applebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.samaa.tv/sitemap.xml

Comments

  • Allow DotBot
  • Allow major search engines full access
  • Allow social media bots
  • Allow Applebot
  • Allow Google's specialized bots
  • Specify the location of the sitemap