tripoto.com
robots.txt

Robots Exclusion Standard data for tripoto.com

Resource Scan

Scan Details

Site Domain tripoto.com
Base Domain tripoto.com
Scan Status Ok
Last Scan2024-11-18T16:47:23+00:00
Next Scan 2024-11-25T16:47:23+00:00

Last Scan

Scanned2024-11-18T16:47:23+00:00
URL https://tripoto.com/robots.txt
Redirect https://www.tripoto.com/robots.txt
Redirect Domain www.tripoto.com
Redirect Base tripoto.com
Domain IPs 34.107.249.112
Redirect IPs 34.107.249.112
Response IP 34.107.249.112
Found Yes
Hash d7ff4eb4125fc5a517986ad75c660c8344a22eccf3a696b71f4a132253ac742b
SimHash 5a1ed8424ab1

Groups

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

rogerbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

blexbot

Rule Path
Disallow /

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

semrushbot-sa

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

mj12bot

Rule Path
Disallow /

ccbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /common_auth/
Disallow /generic_auth/
Disallow /trips/createNew
Disallow /amp/login
Disallow /photos/
Disallow /ApiSearchResults/
Disallow /users/register
Disallow /contests/photo
Disallow /url/
Disallow /dashboards/
Disallow /verify/
Disallow /channel/
Disallow /custom-pages/
Disallow /business
Disallow /business/
Disallow /profile-new
Allow /ads.txt

Other Records

Field Value
sitemap https://www.tripoto.com/sitemap-index.xml