pakistanpressfoundation.org
robots.txt

Robots Exclusion Standard data for pakistanpressfoundation.org

Resource Scan

Scan Details

Site Domain pakistanpressfoundation.org
Base Domain pakistanpressfoundation.org
Scan Status Ok
Last Scan2024-10-30T18:51:44+00:00
Next Scan 2024-11-06T18:51:44+00:00

Last Scan

Scanned2024-10-30T18:51:44+00:00
URL https://pakistanpressfoundation.org/robots.txt
Domain IPs 160.153.0.67
Response IP 160.153.0.67
Found Yes
Hash 3bca410c1e64505d27868fadcbf15a39e47b2638e6cb33bda0e37e720ff3082b
SimHash 67344d526e0d

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-includes/
Disallow /go/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /search?
Disallow /?p=*

mediapartners-google*

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5200

marketwirebot

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

scooperbot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://pakistanpressfoundation.org/sitemap.xml.gz

Warnings

  • 1 invalid line.