emiratesnewsflash.com
robots.txt

Robots Exclusion Standard data for emiratesnewsflash.com

Resource Scan

Scan Details

Site Domain emiratesnewsflash.com
Base Domain emiratesnewsflash.com
Scan Status Ok
Last Scan2025-09-27T03:49:15+00:00
Next Scan 2025-10-04T03:49:15+00:00

Last Scan

Scanned2025-09-27T03:49:15+00:00
URL https://emiratesnewsflash.com/robots.txt
Domain IPs 104.21.74.79, 172.67.200.118, 2606:4700:3033::6815:4a4f, 2606:4700:3035::ac43:c876
Response IP 172.67.200.118
Found Yes
Hash a26666c3907ccc9157a6edc8a7e101710c0c898ae24d37c12e4dfb3601a4a750
SimHash 63344952ee0d

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-includes/
Disallow /go/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /search?
Disallow /?p=*
Disallow /products/

mediapartners-google*

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5200

marketwirebot

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

scooperbot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://emiratespressreleases.net/sitemap.xml.gz

Warnings

  • 1 invalid line.