Articles for category: AI Research

[2503.12808] Estimating stationary mass, frequency by frequency

[Submitted on 17 Mar 2025 (v1), last revised 18 Mar 2025 (this version, v2)] View a PDF of the paper titled Estimating stationary mass, frequency by frequency, by Milind Nakul and 2 other authors View PDF HTML (experimental) Abstract:Suppose we observe a trajectory of length $n$ from an $\alpha$-mixing stochastic process over a finite but potentially large state space. We consider the problem of estimating the probability mass placed by the stationary distribution of any such process on elements that occur with a certain frequency in the observed sequence. We estimate this vector of probabilities in total variation distance, showing

[2412.08116] Diffusion-based Data Augmentation and Knowledge Distillation with Generated Soft Labels Solving Data Scarcity Problems of SAR Oil Spill Segmentation

[Submitted on 11 Dec 2024 (v1), last revised 18 Mar 2025 (this version, v2)] View a PDF of the paper titled Diffusion-based Data Augmentation and Knowledge Distillation with Generated Soft Labels Solving Data Scarcity Problems of SAR Oil Spill Segmentation, by Jaeho Moon and 4 other authors View PDF HTML (experimental) Abstract:Oil spills pose severe environmental risks, making early detection crucial for effective response and mitigation. As Synthetic Aperture Radar (SAR) images operate under all-weather conditions, SAR-based oil spill segmentation enables fast and robust monitoring. However, when using deep learning models, SAR oil spill segmentation often struggles in training due

Clock and Calendar Understanding Challenges in Multimodal LLMs

[Submitted on 7 Feb 2025 (v1), last revised 18 Mar 2025 (this version, v2)] View a PDF of the paper titled Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs, by Rohit Saxena and 2 other authors View PDF HTML (experimental) Abstract:Understanding time from visual representations is a fundamental cognitive skill, yet it remains a challenge for multimodal large language models (MLLMs). In this work, we investigate the capabilities of MLLMs in interpreting time and date through analogue clocks and yearly calendars. To facilitate this, we curated a structured dataset comprising two subsets: 1) $\textit{ClockQA}$, which comprises various

Interpolating Between Autoregressive and Diffusion Language Models

[Submitted on 12 Mar 2025 (v1), last revised 18 Mar 2025 (this version, v2)] View a PDF of the paper titled Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models, by Marianne Arriola and 7 other authors View PDF Abstract:Diffusion language models offer unique benefits over autoregressive models due to their potential for parallelized generation and controllability, yet they lag in likelihood modeling and are limited to fixed-length generation. In this work, we introduce a class of block diffusion language models that interpolate between discrete denoising diffusion and autoregressive models. Block diffusion overcomes key limitations of both approaches by supporting

Weekly Entering & Transitioning – Thread 20 Jan, 2025 – 27 Jan, 2025

Welcome to this week's entering & transitioning thread! This thread is for any questions about getting started, studying, or transitioning into the data science field. Topics include: Learning resources (e.g. books, tutorials, videos) Traditional education (e.g. schools, degrees, electives) Alternative education (e.g. online courses, bootcamps) Job search questions (e.g. resumes, applying, career prospects) Elementary questions (e.g. where to start, what next) While you wait for answers from the community, check out the FAQ and Resources pages on our wiki. You can also search for answers in past weekly threads. submitted by /u/AutoModerator [comments] Source link

One-Minute Daily AI News 3/17/2025

Japan lacks workers to care for the elderly. This company is using AI to help.[1] Mistral AI drops new open-source model that outperforms GPT-4o Mini with fraction of parameters.[2] Amazon’s AI-enhanced Alexa assistant is going to need all your voice recordings, and there’s nothing you can do about it.[3] Marin County oyster business using AI to help run company.[4] Sources: [1] https://www.cnbc.com/2025/03/18/how-ai-can-help-care-for-elderly-people-a-company-in-japan-explains.html [2] https://venturebeat.com/ai/mistral-ai-drops-new-open-source-model-that-outperforms-gpt-4o-mini-with-fraction-of-parameters/ [3] https://gizmodo.com/amazon-will-listen-to-all-your-voice-recordings-if-you-use-alexa-2000576755 [4] https://www.cbsnews.com/sanfrancisco/video/marin-county-oyster-business-using-ai-to-help-run-company/ submitted by /u/Excellent-Target-847 [comments] Source link

Retrieve most asked questions in chatbot

Hi, I have simple chatbot application i want to add functionality to display and choice from most asked questions in last x days. I want to implement semantic search, store those questions in vector database. Is there any solution/tool (including paid services) that will help me to retrieve top n asked questions in one call? I'm afraid if i will check similarity for every questions and this questions will need to be compared to every other question this will degrade performance. Of course i can optimize it and pregenerate by some job but i'm afraid how this will work on

[R] Compute Sponsorships/Grants

Does anyone know of any companies that are providing free/discounted compute, grants, or sponsorships for people wanting to work on their own research ideas? For example, I know fal.ai has a Research Grant program, and so does Google. Curious if people know of any others. submitted by /u/coronary-service [comments] Source link