I would like to do a text-image search. Does AIMv2 have a text encoder like what the CLIP and SigLIP(2) have? Thanks a lot.
I would like to do a text-image search.
Does AIMv2 have a text encoder like what the CLIP and SigLIP(2) have?
Thanks a lot.