Just because I am curious about different AI capabilities, I was looking around at some of the lesser discussed ones (in the mainstream media, at least) and ended up trying Kimi, a Chinese based open source service which seems to be getting favourable comments online by the tech people who make a living by talking about AI.
I am just doing a free trial at the moment, and I am not even sure if I can continue using it for free after that. (It is, though, one of the cheapest models to pay for, if I am inclined.)
Yesterday I tried it on describing a photo, as it seems to sell itself as being very good at dealing with visual stuff.
I uploaded this photo (which appeared in a recent post):
and got this description:
This is a beautiful winter photograph of a traditional Japanese temple or shrine blanketed in snow.
Main Subject
The central focus is a two-story wooden temple building with classic Japanese architectural features:
Curved, tiled roofs with upturned eaves (karahafu style)
Dark wooden beams and structure with intricate joinery visible under the eaves
Snow-covered roof creating a striking contrast against the dark wood
An open entrance on the ground floor revealing a dark interior with what appears to be a small orange/red offering or light inside
Setting & Atmosphere
Snow-covered stone pathway leads directly to the temple entrance, creating strong leading lines and symmetry
Deep snow banks line both sides of the pathway
Stone lanterns (tōrō) flank the entrance, also dusted with snow
Snow-laden trees visible in the background, suggesting a forested or garden setting
Overcast, pale sky giving the scene a serene, muted atmosphere
Composition
The photo uses central symmetry with the pathway drawing the eye straight to the temple entrance. The cool color palette of whites, grays, and dark browns creates a peaceful, contemplative winter mood. The wet stone path suggests recent snowfall or melting, adding texture and subtle reflections.
This appears to be a well-known temple, possibly in Kyoto or another historic Japanese city, captured during a quiet winter moment.
I don't know - if it had just said "looks like a temple, probably in Japan" it wouldn't have struck me as anything too special. But the detail in that description is what freaks me out, a little....
(I subsequently talked to it about the exact name of the temple, and asked whether it could train itself on our conversations, so that it might recognize the specific temple if another person ever uploads a photo of it. It said no, it doesn't train on conversations, and doesn't remember them, unless I ask it to for my own account. So each conversation is "fresh". I guess that's a good thing?)