paradisecoorg.com
robots.txt

Robots Exclusion Standard data for paradisecoorg.com

Resource Scan

Scan Details

Site Domain paradisecoorg.com
Base Domain paradisecoorg.com
Scan Status Ok
Last Scan2025-12-01T03:03:52+00:00
Next Scan 2025-12-08T03:03:52+00:00

Last Scan

Scanned2025-12-01T03:03:52+00:00
URL https://paradisecoorg.com/robots.txt
Domain IPs 103.102.234.44
Response IP 103.102.234.44
Found Yes
Hash 38ac66814dd3599b9c1cded7f1118c690fdc6726c904a0c3ff18bf12bf438b3c
SimHash 6484181326f7

Groups

*

Rule Path
Allow /
Disallow /temp/
Disallow /ref/
Disallow /hero/
Disallow /intro/
Disallow /banner/
Disallow /css/
Disallow /js/
Disallow /*.json$
Disallow /*.md$
Disallow /*.zip$
Allow /images/
Allow /banner/

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

facebookexternalhit

Rule Path
Allow /

twitterbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://paradisecoorg.com/sitemap.xml

Comments

  • Sitemap location
  • Specific crawl delays for better server performance
  • Block certain files and directories
  • Allow important directories
  • Specific bot instructions