presearch.org
robots.txt

Robots Exclusion Standard data for presearch.org

Resource Scan

Scan Details

Site Domain presearch.org
Base Domain presearch.org
Scan Status Ok
Last Scan2024-11-12T23:15:14+00:00
Next Scan 2024-11-19T23:15:14+00:00

Last Scan

Scanned2024-11-12T23:15:14+00:00
URL https://presearch.org/robots.txt
Redirect https://presearch.com/robots.txt
Redirect Domain presearch.com
Redirect Base presearch.com
Domain IPs 108.157.254.47, 108.157.254.59, 108.157.254.65, 108.157.254.67
Redirect IPs 13.228.2.94, 13.228.9.230, 13.251.12.62
Response IP 13.228.2.94
Found Yes
Hash a0a7569329d37b3f521207fa7797437487e300bd564466077441f7beea13260b
SimHash 68505c32eac1

Groups

*

Rule Path
Allow /
Disallow /search
Disallow /*?q=
Disallow /images
Disallow /videos
Disallow /news
Disallow /shopping
Disallow /results
Disallow /ai-results
Disallow /background-url/
Disallow /travel
Disallow /map
Disallow /*?ref=
Disallow /*?session_id=
Disallow /*?page=
Disallow /*.pdf$
Disallow /*.jpg$
Disallow /*.png$
Disallow /*.css$
Disallow /*.js$
Disallow /*.json$
Disallow /*.csv$

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.presearch.com/sitemap.xml

Comments

  • Block unnecessary parameters
  • Block certain file types
  • Sitemap