trulioo.com
robots.txt

Robots Exclusion Standard data for trulioo.com

Resource Scan

Scan Details

Site Domain trulioo.com
Base Domain trulioo.com
Scan Status Ok
Last Scan2024-09-19T14:22:30+00:00
Next Scan 2024-10-19T14:22:30+00:00

Last Scan

Scanned2024-09-19T14:22:30+00:00
URL https://trulioo.com/robots.txt
Redirect https://www.trulioo.com/robots.txt
Redirect Domain www.trulioo.com
Redirect Base trulioo.com
Domain IPs 141.193.213.20, 141.193.213.21
Redirect IPs 141.193.213.20, 141.193.213.21
Response IP 141.193.213.20
Found Yes
Hash b17d6501fff7cf961a655668da7e145de8ccce952332fbc15043b88a9e7f8cb3
SimHash 6e2058d0a0ff

Groups

*

Rule Path
Disallow /wp-admin/

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

facebookbot

Rule Path
Disallow /cdn-cgi/

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.trulioo.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • Block access to specific groups of pages
  • maximum rate is one page every 10 seconds
  • Block FacebookBot
  • 5 seconds between page requests
  • ---------------------------
  • END YOAST BLOCK
  • CLOUDFLARE