newsgear.pk
robots.txt

Robots Exclusion Standard data for newsgear.pk

Resource Scan

Scan Details

Site Domain newsgear.pk
Base Domain newsgear.pk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-09-10T00:19:45+00:00
Next Scan 2024-12-09T00:19:45+00:00

Last Successful Scan

Scanned2024-05-13T14:54:25+00:00
URL https://newsgear.pk/robots.txt
Domain IPs 104.21.94.50, 172.67.219.165, 2606:4700:3033::ac43:dba5, 2606:4700:3035::6815:5e32
Response IP 172.67.219.165
Found Yes
Hash 1f23344071528d588c99468d270e991427b7235b3d27272593e9c039f2caadc9
SimHash 1036d4c2ec1a

Groups

*

Rule Path
Disallow /wp-json/
Disallow /?rest_route=

*

Rule Path
Disallow /?s=
Disallow /search/

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

Comments

  • Block Ahrefs Crawler
  • Block Semrush Crawler
  • Block Moz Crawler
  • Block Majestic Crawler
  • Block archive.org bots