TriVLLo: Tri-View Dynamic Architecture and Unified Cross-Modal Representation for Efficient Fine-Grained Vision–Language Understanding
['Liang Kou', 'Wenlong Fan', 'Xingru Huang', 'Bai Lin', 'Bo Yang', 'Jilin Zhang', 'Yun Lin', 'Meiyu Wang']
/
IEEE Transactions on Reliability
/ Vol. 75
/ No. 1
まだレビューは投稿されていません。あなたが最初のレビューを書きませんか?