nicepredict.com
robots.txt

Robots Exclusion Standard data for nicepredict.com

Resource Scan

Scan Details

Site Domain nicepredict.com
Base Domain nicepredict.com
Scan Status Ok
Last Scan2025-09-25T13:08:36+00:00
Next Scan 2025-10-02T13:08:36+00:00

Last Scan

Scanned2025-09-25T13:08:36+00:00
URL https://nicepredict.com/robots.txt
Domain IPs 104.21.20.76, 172.67.191.227, 2606:4700:3031::6815:144c, 2606:4700:3033::ac43:bfe3
Response IP 104.21.20.76
Found Yes
Hash dc2330d6f7a81d7ff301b8b343d608c160796a57b073d5817df947e8ae064257
SimHash 2c1d111606b0

Groups

*

Rule Path
Disallow
Disallow /admin/
Disallow /login/
Disallow /private/
Disallow /tmp/
Disallow /*?type=
Disallow /*?page=
Disallow /*?dt=
Disallow /*?date=
Disallow /*index.php

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

badbot

Rule Path
Disallow /
Disallow /*?sessionid=
Disallow /*?sort=

*

Rule Path
Allow /*.css$
Allow /*.js$
Allow /*.jpg$
Allow /*.png$
Allow /*.gif$
Allow /*.svg$
Allow /*.xml$
Allow /*.json$
Allow /*.txt$

Other Records

Field Value
sitemap https://nicepredict.com/sitemap.xml

Comments

  • Disallow specific directories
  • Disallow URLs with specific query parameters
  • Disallow URLs with index.php in the path
  • Specific user-agent rules
  • Disallow sessionid and sort query parameters
  • Allow specific file types