Deliverator@kbin.social · 2 years agoMachine Learning Beginner Info/Resourcesplus-squarepinmessage-squaremessage-square3linkfedilinkarrow-up126arrow-down10
arrow-up126arrow-down1message-squareMachine Learning Beginner Info/Resourcesplus-squarepinDeliverator@kbin.social · 2 years agomessage-square3linkfedilink
KingRandomGuyEnglish · 2 years ago[R] Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model arxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up12arrow-down10
arrow-up12arrow-down1external-link[R] Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model arxiv.orgKingRandomGuyEnglish · 2 years agomessage-square0linkfedilink
nsa@kbin.social · 2 years agoWhat's In My Big Data?plus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkWhat's In My Big Data?plus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0linkfedilink
nsa@kbin.social · 2 years agoThe Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AIplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkThe Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AIplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0linkfedilink
KingsmanVince@kbin.social · 2 years agoMM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasksplus-squareaclanthology.orgexternal-linkmessage-square0linkfedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkMM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasksplus-squareaclanthology.orgKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
KingsmanVince@kbin.social · 2 years agoDemystifying CLIP Dataplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkDemystifying CLIP Dataplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
nsa@kbin.social · 2 years agoGPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problemsplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up12arrow-down10
arrow-up12arrow-down1external-linkGPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problemsplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0linkfedilink
KingsmanVince@kbin.social · 2 years agoPaLI-3 Vision Language Models: Smaller, Faster, Strongerplus-squarearxiv.orgexternal-linkmessage-square3linkfedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkPaLI-3 Vision Language Models: Smaller, Faster, Strongerplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square3linkfedilink
KingsmanVince@kbin.social · 2 years agoMiniGPT-v2: large language model as a unified interface for vision-language multi-task learningplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkMiniGPT-v2: large language model as a unified interface for vision-language multi-task learningplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
KingsmanVince@kbin.social · 2 years agoFinetune Like You Pretrain: Improved Finetuning of Zero-Shot Vision Modelsplus-squareopenaccess.thecvf.comexternal-linkmessage-square0linkfedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkFinetune Like You Pretrain: Improved Finetuning of Zero-Shot Vision Modelsplus-squareopenaccess.thecvf.comKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
nsa@kbin.social · 2 years agoA Long Way to Go: Investigating Length Correlations in RLHFplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up12arrow-down10
arrow-up12arrow-down1external-linkA Long Way to Go: Investigating Length Correlations in RLHFplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square0linkfedilink
nsa@kbin.social · 2 years agoThink before you speak: Training Language Models With Pause Tokensplus-squarearxiv.orgexternal-linkmessage-square1linkfedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkThink before you speak: Training Language Models With Pause Tokensplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square1linkfedilink
KingsmanVince@kbin.social · 2 years agoCLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say Noplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkCLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say Noplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
nsa@kbin.social · 2 years agoLanguage Modeling Is Compressionplus-squarearxiv.orgexternal-linkmessage-square7linkfedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkLanguage Modeling Is Compressionplus-squarearxiv.orgnsa@kbin.social · 2 years agomessage-square7linkfedilink
KingsmanVince@kbin.social · 2 years agoScaling Vision-Language Models with Sparse Mixture of Expertsplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkScaling Vision-Language Models with Sparse Mixture of Expertsplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
KingsmanVince@kbin.social · 2 years agoHydra-MoE: A new class of Open-Source Mixture of Expertsplus-squaregithub.comexternal-linkmessage-square0linkfedilinkarrow-up17arrow-down10
arrow-up17arrow-down1external-linkHydra-MoE: A new class of Open-Source Mixture of Expertsplus-squaregithub.comKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
KingsmanVince@kbin.social · 2 years agoBridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasksplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkBridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasksplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
KingsmanVince@kbin.social · 2 years agoFoundational Models Defining a New Era in Vision: A Survey and Outlookplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkFoundational Models Defining a New Era in Vision: A Survey and Outlookplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square0linkfedilink
KingsmanVince@kbin.social · 2 years agoUnifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Language Pre-trainingplus-squareaclanthology.orgexternal-linkmessage-square1linkfedilinkarrow-up16arrow-down11
arrow-up15arrow-down1external-linkUnifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Language Pre-trainingplus-squareaclanthology.orgKingsmanVince@kbin.social · 2 years agomessage-square1linkfedilink
KingsmanVince@kbin.social · 2 years agoMaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasksplus-squarearxiv.orgexternal-linkmessage-square1linkfedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkMaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasksplus-squarearxiv.orgKingsmanVince@kbin.social · 2 years agomessage-square1linkfedilink