[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]
Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-03 | Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models | Manh Duong Nguyen et.al. | 2501.01932 | null |
2025-01-03 | Lyman-alpha resonant-line radiative transfer in expanding media | Aaron Smith et.al. | 2501.01928 | null |
2025-01-03 | Stochastic Thermodynamics of the Two-Dimensional Model of Transistors | Jiayin Gu et.al. | 2501.01919 | null |
2025-01-03 | Global existence for multi-dimensional partially diffusive systems | Jean-Paul Adogbo et.al. | 2501.01839 | null |
2025-01-03 | Ingredients: Blending Custom Photos with Video Diffusion Transformers | Zhengcong Fei et.al. | 2501.01790 | null |
2025-01-03 | Nonparametric estimation of a factorizable density using diffusion models | Hyeok Kyu Kwon et.al. | 2501.01783 | null |
2025-01-03 | Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models | Andrea Matteazzi et.al. | 2501.01761 | null |
2025-01-03 | Can structure influence hydrovoltaic energy generation? Insights from the metallic 1T' and semiconducting 2H phases of MoS |
Kaushik Suvigya et.al. | 2501.01739 | null |
2025-01-03 | Innate behavioural mechanisms and defensive traits in ecological models of predator-prey types | Sangeeta Saha et.al. | 2501.01687 | null |
2025-01-03 | A BDDC method for three-dimensional advection-diffusion problems with an adaptive coarse space | Jie Peng et.al. | 2501.01676 | null |
2025-01-02 | VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control | Yuanpeng Tu et.al. | 2501.01427 | null |
2025-01-02 | Object-level Visual Prompts for Compositional Image Generation | Gaurav Parmar et.al. | 2501.01424 | null |
2025-01-02 | Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models | Jingfeng Yao et.al. | 2501.01423 | link |
2025-01-02 | The Bayesian Global Sky Model (B-GSM): Validation of a Data Driven Bayesian Simultaneous Component Separation and Calibration Algorithm for EoR Foreground Modelling | George Carter et.al. | 2501.01417 | null |
2025-01-02 | Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement | Z. Zhang et.al. | 2501.01368 | null |
2025-01-02 | SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration | Jianyi Wang et.al. | 2501.01320 | null |
2025-01-02 | Cutoff for non-negatively curved diffusions | Justin Salez et.al. | 2501.01304 | null |
2025-01-02 | Self-diffusive dynamics of active Brownian particles at moderate densities | Rodrigo Soto et.al. | 2501.01251 | null |
2025-01-02 | SVFR: A Unified Framework for Generalized Video Face Restoration | Zhiyao Wang et.al. | 2501.01235 | link |
2025-01-02 | Conditional Consistency Guided Image Translation and Enhancement | A. V. Subramanyam et.al. | 2501.01223 | link |
2024-12-30 | Sparse chaos in cortical circuits | Rainer Engelken et.al. | 2412.21188 | null |
2024-12-30 | Downscaling of non van der Waals Semimetallic W5N6 with Resistivity Preservation | Hongze Gao et.al. | 2412.21184 | null |
2024-12-30 | Perfect stationary solutions of reaction-diffusion equations on lattices and regular graphs | Vladimír Švígler et.al. | 2412.21168 | null |
2024-12-30 | Systematic Benchmarking of Macrosegregation: The Performance of a Modified Hybrid Model | Ali Moeinirad et.al. | 2412.21143 | null |
2025-01-02 | Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation | Yuanbo Yang et.al. | 2412.21117 | null |
2024-12-30 | Positional information trade-offs in boundary-driven reaction-diffusion systems | Jonas Berx et.al. | 2412.21113 | null |
2024-12-30 | Lyapunov-Based Deep Neural Networks for Adaptive Control of Stochastic Nonlinear Systems | Saiedeh Akbari et.al. | 2412.21095 | null |
2024-12-30 | Co-diffusion of hydrogen and oxygen for dense oxyhydride synthesis | Masaya Fujioka et.al. | 2412.21086 | null |
2024-12-30 | Quantum Diffusion Model for Quark and Gluon Jet Generation | Mariia Baidachna et.al. | 2412.21082 | link |
2025-01-02 | Edicho: Consistent Image Editing in the Wild | Qingyan Bai et.al. | 2412.21079 | link |
2024-12-27 | Periodically and aperiodically Thue-Morse driven long-range systems: from dynamical localization to slow dynamics | Vatsana Tiwari et.al. | 2412.19736 | null |
2024-12-27 | A coupled mathematical and numerical model for protein spreading and tissue atrophy, applied to Alzheimer's disease | Valentina Pederzoli et.al. | 2412.19661 | null |
2024-12-27 | VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models | Tao Wu et.al. | 2412.19645 | null |
2024-12-27 | Stochastic resetting in a nonequilibrium environment | Koushik Goswami et.al. | 2412.19564 | null |
2024-12-27 | Explicit propagation reversal bounds for bistable differential equations on trees | Petr Stehlík et.al. | 2412.19548 | null |
2024-12-27 | StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture | Miaomiao Dai et.al. | 2412.19535 | null |
2024-12-27 | P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision | Junjie Hu et.al. | 2412.19533 | null |
2024-12-27 | Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy--Fokker--Planck Equations | Yuanfei Huang et.al. | 2412.19520 | null |
2024-12-27 | Dynamical phase transitions in certain non-ergodic stochastic processes | Yogeesh Reddy Yerrababu et.al. | 2412.19516 | null |
2024-12-27 | RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion Model | Xiaohan Zhang et.al. | 2412.19500 | link |
2024-12-24 | PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models | Minghao Chen et.al. | 2412.18608 | null |
2024-12-24 | DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers | Yuntao Chen et.al. | 2412.18607 | null |
2024-12-24 | Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models | Tahira Kazimi et.al. | 2412.18604 | null |
2024-12-24 | DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation | Minghong Cai et.al. | 2412.18597 | link |
2024-12-24 | LatentCRF: Continuous CRF for Efficient Latent Diffusion | Kanchana Ranasinghe et.al. | 2412.18596 | null |
2024-12-24 | Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation | Anselm Krainovic et.al. | 2412.18584 | null |
2024-12-24 | Relativistic Lévy processes | Lucas G. B. de Souza et.al. | 2412.18581 | null |
2024-12-24 | A mathematical framework for modelling CLMM dynamics in continuous time | Shen-Ning Tung et.al. | 2412.18580 | null |
2024-12-24 | 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement | Yihang Luo et.al. | 2412.18565 | null |
2024-12-24 | On the fractional approach to quadratic nonlinear parabolic systems | Oscar Jarrin et.al. | 2412.18473 | null |
2024-12-23 | FaceLift: Single Image to 3D Head with View Generation and GS-LRM | Weijie Lyu et.al. | 2412.17812 | null |
2024-12-23 | Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders | Rui Chen et.al. | 2412.17808 | null |
2024-12-23 | Encoding off-shell effects in top pair production in Direct Diffusion networks | Mathias Kuschick et.al. | 2412.17783 | null |
2024-12-23 | PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion | Sophia Tang et.al. | 2412.17780 | null |
2024-12-23 | Thermal Quench Dynamics of Visons in Gapless Kitaev Spin Liquid | Yang Yang et.al. | 2412.17774 | null |
2024-12-23 | The Superposition of Diffusion Models Using the Itô Density Estimator | Marta Skreta et.al. | 2412.17762 | null |
2024-12-23 | Comprehensive Optimization of Interferometric Diffusing Wave Spectroscopy (iDWS) | Mingjun Zhao et.al. | 2412.17724 | null |
2024-12-23 | The Cosmological Population of Gamma-Ray Bursts from the Disks of Active Galactic Nuclei | Hoyoung D. Kang et.al. | 2412.17714 | null |
2024-12-23 | Euclid: Early Release Observations of diffuse stellar structures and globular clusters as probes of the mass assembly of galaxies in the Dorado group | M. Urbano et.al. | 2412.17672 | null |
2024-12-23 | A Bias-Free Training Paradigm for More General AI-generated Image Detection | Fabrizio Guillaro et.al. | 2412.17671 | null |
2024-12-20 | Personalized Representation from Personalized Generation | Shobhita Sundaram et.al. | 2412.16156 | link |
2024-12-20 | Determination of the Magnetic Structure of Spin Glass Compound $\text{Zn}{0.5}\text{Mn}{0.5}\text{Te}$ Using Real-Space Methods | Sabrina R. Hatt et.al. | 2412.16130 | null |
2024-12-20 | Predicting human cooperation: sensitizing drift-diffusion model to interaction and external stimuli | Lucila G. Alvarez-Zuzek et.al. | 2412.16121 | null |
2024-12-20 | CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up | Songhua Liu et.al. | 2412.16112 | link |
2024-12-20 | Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation | Timur Sattarov et.al. | 2412.16083 | null |
2024-12-20 | Functional Renormalization Group meets Computational Fluid Dynamics: RG flows in a multi-dimensional field space | Niklas Zorbach et.al. | 2412.16053 | null |
2024-12-20 | Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy | Shaoyan Pan et.al. | 2412.16050 | null |
2024-12-20 | SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation | Jiadong Pan et.al. | 2412.16039 | null |
2024-12-20 | Probing lactate exchange and compartmentation in Gray Matter via time-dependent diffusion-weighted MRS | Eloise Mougel et.al. | 2412.16014 | null |
2024-12-20 | Convergence of nonhomogeneous Hawkes processes and Feller random measures | Tristan Pace et.al. | 2412.15999 | null |
2024-12-19 | LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis | Hanlin Wang et.al. | 2412.15214 | link |
2024-12-19 | Flowing from Words to Pixels: A Framework for Cross-Modality Evolution | Qihao Liu et.al. | 2412.15213 | null |
2024-12-19 | Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation | Hadi Alzayer et.al. | 2412.15211 | null |
2024-12-19 | Quantum diffusion and delocalization in one-dimensional band matrices via the flow method | Sofiia Dubova et.al. | 2412.15207 | null |
2024-12-19 | DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation | Wang Zhao et.al. | 2412.15200 | null |
2024-12-19 | AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Moayed Haji-Ali et.al. | 2412.15191 | null |
2024-12-19 | LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation | Weijia Shi et.al. | 2412.15188 | null |
2024-12-19 | Tiled Diffusion | Or Madar et.al. | 2412.15185 | null |
2024-12-19 | Option Pricing with a Compound CARMA(p,q)-Hawkes | Lorenzo Mercuri et.al. | 2412.15172 | null |
2024-12-19 | OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization | Jiacheng Zhang et.al. | 2412.15159 | null |
2024-12-18 | AniDoc: Animation Creation Made Easier | Yihao Meng et.al. | 2412.14173 | null |
2024-12-18 | E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling | Zhihang Yuan et.al. | 2412.14170 | null |
2024-12-18 | Autoregressive Video Generation without Vector Quantization | Haoge Deng et.al. | 2412.14169 | link |
2024-12-18 | VideoDPO: Omni-Preference Alignment for Video Diffusion Generation | Runtao Liu et.al. | 2412.14167 | null |
2024-12-18 | AKiRa: Augmentation Kit on Rays for optical video generation | Xi Wang et.al. | 2412.14158 | null |
2024-12-18 | MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation | Shenhao Zhu et.al. | 2412.14148 | null |
2024-12-18 | Measuring collective diffusion properties by counting particles in boxes | Adam Carter et.al. | 2412.14122 | null |
2024-12-18 | SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation | Tong Chen et.al. | 2412.14018 | null |
2024-12-18 | A perturbative approach to the macroscopic fluctuation theory | Thierry Bodineau et.al. | 2412.13991 | null |
2024-12-18 | Double sine-Gordon class of universal coarsening dynamics in a spin-1 Bose gas | Ido Siovitz et.al. | 2412.13986 | null |
2024-12-17 | CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models | Gaoyang Zhang et.al. | 2412.13195 | link |
2024-12-17 | StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models | Yunzhi Yan et.al. | 2412.13188 | null |
2024-12-17 | Move-in-2D: 2D-Conditioned Human Motion Generation | Hsin-Ping Huang et.al. | 2412.13185 | null |
2024-12-17 | A finite volume scheme for the local sensing chemotaxis model | Maxime Herda et.al. | 2412.13143 | null |
2024-12-17 | Symmetries and exact solutions of a reaction-diffusion system arising in population dynamics | Philip Broadbridge et.al. | 2412.13097 | null |
2024-12-17 | Explorando el impacto de los gradientes químicos en los procesos de mezcla del interior estelar | M. M. Ocampo et.al. | 2412.13087 | null |
2024-12-17 | Prompt Augmentation for Self-supervised Text-guided Image Manipulation | Rumeysa Bodur et.al. | 2412.13081 | null |
2024-12-17 | 3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation | Haoshen Wang et.al. | 2412.13059 | null |
2024-12-17 | HCG 57: Evidence for shock-heated intergalactic gas from X-rays and optical emission line spectroscopy | Ewan O'Sullivan et.al. | 2412.13055 | null |
2024-12-17 | Distributed Normal Map-based Stochastic Proximal Gradient Methods over Networks | Kun Huang et.al. | 2412.13054 | null |
2024-12-16 | Causal Diffusion Transformers for Generative Modeling | Chaorui Deng et.al. | 2412.12095 | link |
2024-12-16 | CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models | Felix Taubner et.al. | 2412.12093 | null |
2024-12-16 | Wonderland: Navigating 3D Scenes from a Single Image | Hanwen Liang et.al. | 2412.12091 | null |
2024-12-16 | IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations | Zhibing Li et.al. | 2412.12083 | null |
2024-12-16 | A LoRA is Worth a Thousand Pictures | Chenxi Liu et.al. | 2412.12048 | null |
2024-12-16 | FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning | Gaojian Wang et.al. | 2412.12032 | link |
2024-12-16 | The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation | Gilles Mordant et.al. | 2412.12007 | null |
2024-12-16 | Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data | Onur Tasar et.al. | 2412.11972 | null |
2024-12-16 | Multiplexing in Networks and Diffusion | Arun G. Chandrasekhar et.al. | 2412.11957 | null |
2024-12-16 | DRUM: Diffusion-based runoff model for probabilistic flood forecasting | Zhigang Ou et.al. | 2412.11942 | null |
2024-12-13 | Towards a foundation model for heavy-ion collision experiments through point cloud diffusion | Manjunath Omana Kuttan et.al. | 2412.10352 | null |
2024-12-13 | Ensuring Force Safety in Vision-Guided Robotic Manipulation via Implicit Tactile Calibration | Lai Wei et.al. | 2412.10349 | null |
2024-12-13 | A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation | Wangkai Li et.al. | 2412.10339 | null |
2024-12-13 | BrushEdit: All-In-One Image Inpainting and Editing | Yaowei Li et.al. | 2412.10316 | null |
2024-12-13 | Coherent 3D Scene Diffusion From a Single RGB Image | Manuel Dahnert et.al. | 2412.10294 | null |
2024-12-13 | TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation | Xingrui Wang et.al. | 2412.10275 | null |
2024-12-13 | Probabilistic Inverse Cameras: Image to 3D via Multiview Geometry | Rishabh Kabra et.al. | 2412.10273 | null |
2024-12-13 | Quantum transport theory for unconventional magnets; interplay of altermagnetism and p-wave magnetism with superconductivity | Tim Kokkeler et.al. | 2412.10236 | null |
2024-12-13 | Motion of Islands of Elastic Thin Films in the Dewetting Regime | Gianni Dal Maso et.al. | 2412.10222 | null |
2024-12-13 | Learning Complex Non-Rigid Image Edits from Multimodal Conditioning | Nikolai Warner et.al. | 2412.10219 | null |
2024-12-12 | FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion | Haonan Qiu et.al. | 2412.09626 | null |
2024-12-12 | Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors | Yue Feng et.al. | 2412.09625 | null |
2024-12-12 | OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation | Weiqi Li et.al. | 2412.09623 | null |
2024-12-12 | LoRACLR: Contrastive Adaptation for Customization of Diffusion Models | Enis Simsar et.al. | 2412.09622 | null |
2024-12-12 | SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training | Dongting Hu et.al. | 2412.09619 | null |
2024-12-12 | EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Zhuofan Zong et.al. | 2412.09618 | null |
2024-12-12 | Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG | Kavana Venkatesh et.al. | 2412.09614 | null |
2024-12-12 | LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors | Yabo Chen et.al. | 2412.09597 | null |
2024-12-12 | Probing a diffuse flux of axion-like particles from galactic supernovae with neutrino water Cherenkov detectors | David Alonso-González et.al. | 2412.09595 | null |
2024-12-12 | Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion | Zexin He et.al. | 2412.09593 | null |
2024-12-11 | ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation | Daniel Winter et.al. | 2412.08645 | null |
2024-12-11 | Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren et.al. | 2412.08642 | null |
2024-12-11 | DMin: Scalable Training Data Influence Estimation for Diffusion Models | Huawei Lin et.al. | 2412.08637 | link |
2024-12-11 | Multimodal Latent Language Modeling with Next-Token Diffusion | Yutao Sun et.al. | 2412.08635 | link |
2024-12-11 | FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models | Vladimir Kulikov et.al. | 2412.08629 | link |
2024-12-11 | Mel-Refine: A Plug-and-Play Approach to Refine Mel-Spectrogram in Audio Generation | Hongming Guo et.al. | 2412.08577 | null |
2024-12-11 | TryOffAnyone: Tiled Cloth Generation from a Dressed Person | Ioannis Xarchakos et.al. | 2412.08573 | link |
2024-12-11 | Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations | Hugo Flores García et.al. | 2412.08550 | null |
2024-12-11 | Phenomenology of Neutrino-Dark Matter Interaction in DSNB and AGN | Po-Yan Tseng et.al. | 2412.08537 | null |
2024-12-11 | Limited thermal and spin transport in a dissipative superfluid junction | Meng-Zi Huang et.al. | 2412.08525 | null |
2024-12-10 | Video Motion Transfer with Diffusion Transformers | Alexander Pondaven et.al. | 2412.07776 | link |
2024-12-10 | Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets | Zhen Liu et.al. | 2412.07775 | null |
2024-12-10 | From Slow Bidirectional to Fast Causal Video Generators | Tianwei Yin et.al. | 2412.07772 | null |
2024-12-10 | From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos | Matthew Wallingford et.al. | 2412.07770 | link |
2024-12-10 | Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds | Xiaoyu Xiang et.al. | 2412.07766 | null |
2024-12-10 | Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation | Jingxi Chen et.al. | 2412.07761 | null |
2024-12-10 | SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints | Jianhong Bai et.al. | 2412.07760 | link |
2024-12-10 | 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Xiao Fu et.al. | 2412.07759 | null |
2024-12-10 | PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation | Fatemeh Nazarieh et.al. | 2412.07754 | null |
2024-12-10 | Structural, Electronic, and Li-ion Adsorption Properties of PolyPyGY Explored by First-Principles and Machine Learning Simulations: A New Multi-Ringed 2D Carbon Allotrope | K. A. L. Lima et.al. | 2412.07753 | null |
2024-12-09 | [MASK] is All You Need | Vincent Tao Hu et.al. | 2412.06787 | link |
2024-12-09 | Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis | M. Hamza Mughal et.al. | 2412.06786 | null |
2024-12-09 | Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation | Ruihan Gao et.al. | 2412.06785 | link |
2024-12-09 | CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction | Zhefei Gong et.al. | 2412.06782 | null |
2024-12-09 | Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation | Nicolas Dufour et.al. | 2412.06781 | link |
2024-12-09 | Diverse Score Distillation | Yanbo Xu et.al. | 2412.06780 | null |
2024-12-09 | Visual Lexicon: Rich Image Features in Language Space | XuDong Wang et.al. | 2412.06774 | null |
2024-12-09 | Interface dynamics in a degenerate Cahn-Hilliard model for viscoelastic phase separation | Katharina Hopf et.al. | 2412.06762 | null |
2024-12-09 | Speckle imaging with blind source separation and total variation deconvolution | Randy Bartels et.al. | 2412.06755 | null |
2024-12-09 | InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention | Howard Zhang et.al. | 2412.06753 | null |
2024-12-06 | Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories | Susung Hong et.al. | 2412.05279 | null |
2024-12-06 | Birth and Death of a Rose | Chen Geng et.al. | 2412.05278 | null |
2024-12-06 | MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models | Tuna Han Salih Meral et.al. | 2412.05275 | null |
2024-12-06 | Mind the Time: Temporally-Controlled Multi-Event Video Generation | Ziyi Wu et.al. | 2412.05263 | null |
2024-12-06 | Extrapolated Urban View Synthesis Benchmark | Xiangyu Han et.al. | 2412.05256 | link |
2024-12-06 | A kinetically constrained model exhibiting non-linear diffusion and jamming | Abhishek Raj et.al. | 2412.05231 | null |
2024-12-06 | Diffusion cascade in a model of interacting random walkers | Abhishek Raj et.al. | 2412.05222 | null |
2024-12-06 | Go-or-Grow Models in Biology: a Monster on a Leash | R. Thiessen et.al. | 2412.05191 | null |
2024-12-06 | DNF: Unconditional 4D Generation with Dictionary-based Neural Fields | Xinyi Zhang et.al. | 2412.05161 | null |
2024-12-06 | Learning Hidden Physics and System Parameters with Deep Operator Networks | Vijay Kag et.al. | 2412.05133 | null |
2024-12-05 | PaintScene4D: Consistent 4D Scene Generation from Text Prompts | Vinayak Gupta et.al. | 2412.04471 | null |
2024-12-05 | Turbo3D: Ultra-fast Text-to-3D Generation | Hanzhe Hu et.al. | 2412.04470 | null |
2024-12-05 | 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion | Chaoyang Wang et.al. | 2412.04462 | null |
2024-12-05 | LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors | Yusuf Dalva et.al. | 2412.04460 | null |
2024-12-05 | Four-Plane Factorized Video Autoencoders | Mohammed Suhail et.al. | 2412.04452 | null |
2024-12-05 | MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation | Longtao Zheng et.al. | 2412.04448 | null |
2024-12-05 | DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models | Yizhuo Li et.al. | 2412.04446 | null |
2024-12-05 | Learning Artistic Signatures: Symmetry Discovery and Style Transfer | Emma Finn et.al. | 2412.04441 | null |
2024-12-05 | Structure of undercompressive shock waves in three-phase flow in porous media | L. F. Lozano et.al. | 2412.04439 | null |
2024-12-05 | Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation | Yuying Ge et.al. | 2412.04432 | link |
2024-12-04 | Navigation World Models | Amir Bar et.al. | 2412.03572 | null |
2024-12-04 | MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation | Zehuan Huang et.al. | 2412.03558 | null |
2024-12-04 | Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention | Hannan Lu et.al. | 2412.03520 | null |
2024-12-04 | NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images | Lingen Li et.al. | 2412.03517 | null |
2024-12-04 | Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion | Shengyuan Zhang et.al. | 2412.03515 | link |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-04 | Testing the Universality of Self-Organized Criticality in Galactic, Extra-Galactic, and Black-Hole Systems | Markus Aschwanden et.al. | 2412.03499 | null |
2024-12-04 | TRENDy: Temporal Regression of Effective Non-linear Dynamics | Matthew Ricci et.al. | 2412.03496 | null |
2024-12-04 | Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective | Neta Shaul et.al. | 2412.03487 | null |
2024-12-04 | Universal Constants and Energy Integral in Self-Organized Criticality Systems | Markus Aschwanden et.al. | 2412.03481 | null |
2024-12-03 | Diffusion-based Visual Anagram as Multi-task Learning | Zhiyuan Xu et.al. | 2412.02693 | link |
2024-12-03 | FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation | Kefan Chen et.al. | 2412.02690 | null |
2024-12-03 | SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance | Viet Nguyen et.al. | 2412.02687 | null |
2024-12-03 | Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation | Xuanlin Li et.al. | 2412.02676 | null |
2024-12-03 | Scaling limit of first passage percolation geodesics on planar maps | Emmanuel Kammerer et.al. | 2412.02666 | null |
2024-12-03 | Asymptically full measure sets of almost-periodic solutions for the NLS equation | Luca Biasco et.al. | 2412.02648 | null |
2024-12-03 | Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation | Yiftach Edelstein et.al. | 2412.02631 | null |
2024-12-03 | Convergence of a heterogeneous Allen-Cahn equation to weighted mean curvature flow | Likhit Ganedi et.al. | 2412.02567 | null |
2024-12-03 | Bayesian data analysis for sky-averaged 21-cm experiments with contamination from linearly polarised foreground | Emma Shen et.al. | 2412.02552 | null |
2024-12-03 | Unveiling Concept Attribution in Diffusion Models | Quang H. Nguyen et.al. | 2412.02542 | null |
2024-11-29 | Open source Differentiable ODE Solving Infrastructure | Rakshit Kr. Singh et.al. | 2411.19882 | null |
2024-11-29 | Gravity's role in taming the Tayler instability in red giant cores | Domenico G. Meduri et.al. | 2411.19849 | null |
2024-11-29 | Classical transport in a maximally chaotic chain | William Alderson et.al. | 2411.19828 | null |
2024-11-29 | Open and trapping channels in complex resonant media | Romain Rescanieres et.al. | 2411.19818 | null |
2024-11-29 | MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks | Yiming Wu et.al. | 2411.19786 | null |
2024-11-29 | Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy | Jeheon Woo et.al. | 2411.19769 | null |
2024-11-29 | Insensitizing controls of a volume-surface reaction-diffusion equation with dynamic boundary conditions | Idriss Boutaayamoua et.al. | 2411.19760 | null |
2024-11-29 | TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting | Bojun Xiong et.al. | 2411.19654 | null |
2024-11-29 | Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing | Wenyi Mo et.al. | 2411.19652 | link |
2024-11-29 | CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation | Qixiu Li et.al. | 2411.19650 | null |
2024-11-27 | GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data | Wentao Wang et.al. | 2411.18624 | null |
2024-11-27 | Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Shengqu Cai et.al. | 2411.18616 | null |
2024-11-27 | CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models | Rundi Wu et.al. | 2411.18613 | null |
2024-11-27 | Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis | Eva Prakash et.al. | 2411.18602 | null |
2024-11-27 | Frequency redistribution and step-size distribution of light scattered by atomic vapor: applications to Lévy flight random walk | Isaac C. Nunes et.al. | 2411.18570 | null |
2024-11-27 | DexDiffuser: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation | Zhixuan Liang et.al. | 2411.18562 | null |
2024-11-27 | FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion | Haosen Yang et.al. | 2411.18552 | null |
2024-11-27 | The Rise and Fall of Ideas' Popularity | Piero Mazzarisi et.al. | 2411.18541 | link |
2024-11-27 | Chemical pressure tuning of competing orders in $\textrm{Ba}{1-x}\textrm{Ca}{x}\textrm{Ni}{2}\textrm{As}{2}$ | F. Henssler et.al. | 2411.18536 | null |
2024-11-27 | Spin liquid properties of the kagome material Cu |
F. L. Pratt et.al. | 2411.18518 | null |
2024-11-26 | StableAnimator: High-Quality Identity-Preserving Human Image Animation | Shuyuan Tu et.al. | 2411.17697 | link |
2024-11-26 | ScribbleLight: Single Image Indoor Relighting with Scribbles | Jun Myeong Choi et.al. | 2411.17696 | null |
2024-11-26 | GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2411.17687 | null |
2024-11-26 | Exclusion processes with non-reversible boundary: hydrodynamics and large deviations | Claudio Landim et.al. | 2411.17653 | null |
2024-11-26 | A robust image encryption scheme based on new 4-D hyperchaotic system and elliptic curve | Yehia Lalili et.al. | 2411.17643 | null |
2024-11-26 | Accelerating Vision Diffusion Transformers with Skip Branches | Guanjie Chen et.al. | 2411.17616 | link |
2024-11-26 | Mixed-State Quantum Denoising Diffusion Probabilistic Model | Gino Kwun et.al. | 2411.17608 | null |
2024-11-26 | VideoDirector: Precise Video Editing via Text-to-Video Models | Yukun Wang et.al. | 2411.17592 | null |
2024-11-26 | IMPROVE: Improving Medical Plausibility without Reliance on HumanValidation -- An Enhanced Prototype-Guided Diffusion Framework | Anurag Shandilya et.al. | 2411.17535 | null |
2024-11-26 | FTMoMamba: Motion Generation with Frequency and Text State Space Models | Chengjian Li et.al. | 2411.17532 | null |
2024-11-25 | The impact of resistivity on the variability of black hole accretion flows | Antonios Nathanail et.al. | 2411.16684 | null |
2024-11-25 | Generative Omnimatte: Learning to Decompose Video into Layers | Yao-Chih Lee et.al. | 2411.16683 | null |
2024-11-25 | Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Bernd Von Gimborn et.al. | 2411.16668 | null |
2024-11-25 | LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction | Yiran Sun et.al. | 2411.16629 | link |
2024-11-25 | Inference-Time Policy Steering through Human Interactions | Yanwei Wang et.al. | 2411.16627 | null |
2024-11-25 | Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models | Ronghuan Wu et.al. | 2411.16602 | null |
2024-11-25 | Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification | Andre Kassis et.al. | 2411.16598 | link |
2024-11-25 | Sequential data assimilation for PDEs using shape-morphing solutions | Zachary T. Hilliard et.al. | 2411.16593 | null |
2024-11-25 | Rethinking Diffusion for Text-Driven Human Motion Generation | Zichong Meng et.al. | 2411.16575 | null |
2024-11-25 | Representation Collapsing Problems in Vector Quantization | Wenhao Zhao et.al. | 2411.16550 | null |
2024-11-22 | DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving | Bencheng Liao et.al. | 2411.15139 | link |
2024-11-22 | Material Anything: Generating Materials for Any 3D Object via Diffusion | Xin Huang et.al. | 2411.15138 | null |
2024-11-22 | VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement | Daeun Lee et.al. | 2411.15115 | null |
2024-11-22 | Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion | Samarth N Ramesh et.al. | 2411.15113 | null |
2024-11-22 | OminiControl: Minimal and Universal Control for Diffusion Transformer | Zhenxiong Tan et.al. | 2411.15098 | link |
2024-11-22 | Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation | Lakshmikar R. Polamreddy et.al. | 2411.15084 | link |
2024-11-22 | The 1D nonlocal Fisher-KPP equation with a top hat kernel. Part 3. The effect of perturbations in the kernel | David John Needham et.al. | 2411.15054 | null |
2024-11-22 | HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads | Yu Xu et.al. | 2411.15034 | null |
2024-11-22 | FloAt: Flow Warping of Self-Attention for Clothing Animation Generation | Swasti Shreya Mishra et.al. | 2411.15028 | null |
2024-11-22 | 3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes | Jan Held et.al. | 2411.14974 | link |
2024-11-21 | Stable Flow: Vital Layers for Training-Free Image Editing | Omri Avrahami et.al. | 2411.14430 | null |
2024-11-21 | Low-Field Regime of Magnon Transport in Yttrium Iron Garnet | Hossein Taghinejad et.al. | 2411.14428 | null |
2024-11-21 | Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation | Zhuoman Liu et.al. | 2411.14423 | null |
2024-11-21 | Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation | Yuanhao Cai et.al. | 2411.14384 | null |
2024-11-21 | CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields | Xin-Yang Liu et.al. | 2411.14378 | null |
2024-11-21 | Anomalous transport in U(1)-symmetric quantum circuits | Alessandro Summer et.al. | 2411.14357 | null |
2024-11-21 | Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models | Houze Liu et.al. | 2411.14353 | null |
2024-11-21 | Generalized Finite Difference Method for Solving Stochastic Diffusion Equations | Faezeh Nassajian Mojarrad et.al. | 2411.14333 | null |
2024-11-21 | StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart | Jian Shi et.al. | 2411.14295 | null |
2024-11-21 | Guided MRI Reconstruction via Schrödinger Bridge | Yue Wang et.al. | 2411.14269 | null |
2024-11-20 | REDUCIO! Generating 1024 |
Rui Tian et.al. | 2411.13552 | link |
2024-11-20 | A Survey of H I and O VI Absorption Lines in the Outskirts of |
Priscilla Holguin Luna et.al. | 2411.13551 | null |
2024-11-20 | HF-Diff: High-Frequency Perceptual Loss and Distribution Matching for One-Step Diffusion-Based Image Super-Resolution | Shoaib Meraj Sami et.al. | 2411.13548 | null |
2024-11-20 | Identity Preserving 3D Head Stylization with Multiview Score Distillation | Bahri Batuhan Bilecen et.al. | 2411.13536 | null |
2024-11-20 | Dense Suspensions in Rotary Shear | Naveen Kumar Agrawal et.al. | 2411.13463 | null |
2024-11-20 | Sampling and Integration of Logconcave Functions by Algorithmic Diffusion | Yunbum Kook et.al. | 2411.13462 | null |
2024-11-20 | From Prompt Engineering to Prompt Craft | Joseph Lindley et.al. | 2411.13422 | null |
2024-11-20 | Heuristically Adaptive Diffusion-Model Evolutionary Strategy | Benedikt Hartl et.al. | 2411.13420 | null |
2024-11-20 | Adversarial Diffusion Compression for Real-World Image Super-Resolution | Bin Chen et.al. | 2411.13383 | null |
2024-11-20 | New Insights on the High Reconnection Rate and the Diminishment of Ion Outflow | Cheng-Yu Fan et.al. | 2411.13352 | null |
2024-11-19 | Quantum-assisted hλ-adaptive finite element method | R. H. Drebotiy et.al. | 2411.12687 | null |
2024-11-19 | PoM: Efficient Image and Video Generation with the Polynomial Mixer | David Picard et.al. | 2411.12663 | link |
2024-11-19 | Implementation and performance of a fiber-coupled CMOS camera in an ultrafast reflective high-energy electron diffraction experiment | Jonas D. Fortmann et.al. | 2411.12660 | null |
2024-11-19 | Scaling invariance for the diffusion coefficient in a dissipative standard mapping | Edson D. Leonel et.al. | 2411.12648 | null |
2024-11-19 | Improving Controllability and Editability for Pretrained Text-to-Music Generation Models | Yixiao Zhang et.al. | 2411.12641 | null |
2024-11-19 | ChemSICal: Evaluating a Stochastic Chemical Reaction Network for Molecular Multiple Access | Alexander Wietfeld et.al. | 2411.12637 | null |
2024-11-19 | Instant Policy: In-Context Imitation Learning via Graph Diffusion | Vitalis Vosylius et.al. | 2411.12633 | null |
2024-11-19 | Exploring the Manifold of Neural Networks Using Diffusion Geometry | Elliott Abel et.al. | 2411.12626 | null |
2024-11-19 | CHANG-ES XXXV: Cosmic Ray Transport and Magnetic Field Structure of NGC 3556 at 3 GHz | Jianghui Xu et.al. | 2411.12564 | null |
2024-11-19 | When Theory Meets Experiment: What Does it Take to Accurately Predict |
Dietmar Paschek et.al. | 2411.12545 | null |
2024-11-18 | Equivariant spatio-hemispherical networks for diffusion MRI deconvolution | Axel Elaldi et.al. | 2411.11819 | link |
2024-11-18 | Fabrication of Hierarchical Sapphire Nanostructures using Ultrafast Laser Induced Morphology Change | Joshua Cheung et.al. | 2411.11817 | null |
2024-11-18 | Open Catalyst Experiments 2024 (OCx24): Bridging Experiments and Computational Models | Jehad Abed et.al. | 2411.11783 | null |
2024-11-18 | Milstein-type schemes for McKean-Vlasov SDEs driven by Brownian motion and Poisson random measure (with super-linear coefficients) | Sani Biswas et.al. | 2411.11759 | null |
2024-11-18 | Correlated emission lasing in a single quantum dot embedded inside a bimodal photonic crystal cavity | Lavakumar Addepalli et.al. | 2411.11744 | null |
2024-11-18 | Aligning Few-Step Diffusion Models with Dense Reward Difference Learning | Ziyi Zhang et.al. | 2411.11727 | link |
2024-11-18 | Robust Reinforcement Learning under Diffusion Models for Data with Jumps | Chenyang Jiang et.al. | 2411.11697 | null |
2024-11-18 | Active droplets controlled by enzymatic reactions | Jacques Fries et.al. | 2411.11696 | null |
2024-11-18 | Hamiltonian Monte Carlo vs. event-chain Monte Carlo: an appraisal of sampling strategies beyond the diffusive regime | Werner Krauth et.al. | 2411.11690 | null |
2024-11-18 | Conceptwm: A Diffusion Model Watermark for Concept Protection | Liangqi Lei et.al. | 2411.11688 | null |
2024-11-15 | M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation | Sucheng Ren et.al. | 2411.10433 | link |
2024-11-15 | Mitigating Parameter Degeneracy using Joint Conditional Diffusion Model for WECC Composite Load Model in Power Systems | Feiqin Zhu et.al. | 2411.10431 | null |
2024-11-15 | Lower bounds on the top Lyapunov exponent for linear PDEs driven by the 2D stochastic Navier-Stokes equations | Martin Hairer et.al. | 2411.10419 | null |
2024-11-15 | Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation | Markus Karmann et.al. | 2411.10411 | null |
2024-11-15 | Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion | Haoran Wei et.al. | 2411.10369 | null |
2024-11-15 | Anisotropic Field Theory of Wave Transmission Statistics in Disordered Media | David Gaspard et.al. | 2411.10360 | null |
2024-11-15 | Transmission eigenvalue distribution in disordered media from anisotropic field theory | David Gaspard et.al. | 2411.10355 | null |
2024-11-15 | Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence | Guodong Sun et.al. | 2411.10321 | null |
2024-11-15 | Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting | Ziqi Xie et.al. | 2411.10309 | link |
2024-11-15 | The Unreasonable Effectiveness of Guidance for Diffusion Models | Tim Kaiser et.al. | 2411.10257 | null |
2024-11-14 | MagicQuill: An Intelligent Interactive Image Editing System | Zichen Liu et.al. | 2411.09703 | null |
2024-11-14 | Motion Before Action: Diffusing Object Motion as Manipulation Condition | Yup Su et.al. | 2411.09658 | null |
2024-11-14 | The lowest-radiation environments in the Solar System: new opportunities for underground rare-event searches | Xilin Zhang et.al. | 2411.09634 | null |
2024-11-14 | NEP-MB-pol: A unified machine-learned framework for fast and accurate prediction of water's thermodynamic and transport properties | Ke Xu et.al. | 2411.09631 | link |
2024-11-14 | MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI | Nancy R. Newlin et.al. | 2411.09618 | link |
2024-11-14 | Carl Wirtz' article from 1924 in Astronomische Nachrichten on the radial motions of spiral nebulae | Tom Richtler et.al. | 2411.09606 | null |
2024-11-14 | Numerical prediction of the steady-state distribution under stochastic resetting from measurements | Ron Vatash et.al. | 2411.09563 | null |
2024-11-14 | FlowNav: Learning Efficient Navigation Policies via Conditional Flow Matching | Samiran Gode et.al. | 2411.09524 | null |
2024-11-14 | Enhanced HLLEM and HLL-CPS schemes for all Mach number flows based using anti-diffusion coefficients | A. Gogoi et.al. | 2411.09509 | null |
2024-11-14 | Golden Noise for Diffusion Models: A Learning Framework | Zikai Zhou et.al. | 2411.09502 | link |
2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
2024-11-13 | On the number of crossings and bouncings of a diffusion at a sticky threshold | Alexis Anagnostakis et.al. | 2411.08846 | null |
2024-11-13 | Offline Adaptation of Quadruped Locomotion using Diffusion Models | Reece O'Mahoney et.al. | 2411.08832 | null |
2024-11-13 | Fluctuations of driven probes reveal nonequilibrium transitions in complex fluids | Danilo Forastiere et.al. | 2411.08817 | null |
2024-11-13 | A combined diffusion/rate equation model to describe charge generation in phase-separated donor-acceptor blends | Phillip Teschner et.al. | 2411.08812 | null |
2024-11-13 | A novel imaging setup for hybrid radiotherapy tailored PET/MR in patients with head and neck cancer | R. M. Winter et.al. | 2411.08783 | null |
2024-11-13 | Particle acceleration and multi-messenger radiation from Ultra-Luminous X-ray Sources -- A new class of Galactic PeVatrons | Enrico Peretti et.al. | 2411.08762 | null |
2024-11-13 | Berry-Esseen bounds for large-time asymptotics of one-dimensional diffusion processes via Malliavin-Stein method | Seiichiro Kusuoka et.al. | 2411.08725 | null |
2024-11-13 | Starburst heating and synthetic ion column densities in multiphase galactic outflows | D. Villarruel et.al. | 2411.08704 | null |
2024-11-13 | MikuDance: Animating Character Art with Mixed Motion Dynamics | Jiaxu Zhang et.al. | 2411.08656 | null |
2024-11-12 | Scaling Properties of Diffusion Models for Perceptual Tasks | Rahul Ravishankar et.al. | 2411.08034 | null |
2024-11-12 | GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation | Yushi Lan et.al. | 2411.08033 | null |
2024-11-12 | Commissioning of the 2.6 m tall two-phase xenon time projection chamber of Xenoscope | M. Adrover et.al. | 2411.08022 | null |
2024-11-12 | Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings | Aditya Sanghi et.al. | 2411.08017 | link |
2024-11-12 | Quantitative Phase-Field Modeling of Rapid Alloy Solidification | Kaihua Ji et.al. | 2411.07953 | null |
2024-11-12 | Microscopic fluctuations in the spreading fronts of circular wetting liquid droplets | J. M. Marcos et.al. | 2411.07923 | null |
2024-11-12 | When Randomness Beats Redundancy: Insights into the Diffusion of Complex Contagions | Allison Wan et.al. | 2411.07907 | null |
2024-11-12 | Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules | Binxu Wang et.al. | 2411.07873 | null |
2024-11-12 | API Phonons: Python Interfaces for Phonon Transport Modeling | Xin Qian et.al. | 2411.07774 | link |
2024-11-12 | Novel View Synthesis with Pixel-Space Diffusion Models | Noam Elata et.al. | 2411.07765 | null |
2024-11-11 | Score-based generative diffusion with "active" correlated noise sources | Alexandra Lamtyugina et.al. | 2411.07233 | null |
2024-11-11 | Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models | Yoad Tewel et.al. | 2411.07232 | null |
2024-11-11 | DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID | Nyle Siddiqui et.al. | 2411.07205 | link |
2024-11-11 | Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter | Domitille Gérard et.al. | 2411.07202 | null |
2024-11-11 | OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision | Cong Wei et.al. | 2411.07199 | null |
2024-11-11 | Lifetime-Limited and Tunable Emission from Charge-Stabilized Nickel Vacancy Centers in Diamond | I. M. Morris et.al. | 2411.07196 | null |
2024-11-11 | More Expressive Attention with Negative Weights | Ang Lv et.al. | 2411.07176 | link |
2024-11-11 | Edify 3D: Scalable High-Quality 3D Asset Generation | NVIDIA et.al. | 2411.07135 | null |
2024-11-11 | Zero-sum Dynkin games under common and independent Poisson constraints | David Hobson et.al. | 2411.07134 | null |
2024-11-11 | Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models | NVIDIA et.al. | 2411.07126 | null |
2024-11-08 | Model for Diffusion Limited Crystal Growth with and without Growth Rate Dispersion | Douglas A. Barlow et.al. | 2411.05768 | null |
2024-11-08 | Tract-RLFormer: A Tract-Specific RL policy based Decoder-only Transformer Network | Ankita Joshi et.al. | 2411.05757 | null |
2024-11-08 | StdGEN: Semantic-Decomposed 3D Character Generation from Single Images | Yuze He et.al. | 2411.05738 | null |
2024-11-08 | Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models | Jia-Hong Huang et.al. | 2411.05706 | null |
2024-11-08 | Improving Molecular Graph Generation with Flow Matching and Optimal Transport | Xiaoyang Hou et.al. | 2411.05676 | null |
2024-11-08 | Ultra-high-energy cosmic rays from ultra-fast outflows of active galactic nuclei | Domenik Ehlert et.al. | 2411.05667 | null |
2024-11-08 | The rush to the poles and the role of magnetic buoyancy in the solar dynamo | Simon Cloutier et.al. | 2411.05623 | null |
2024-11-08 | Probing the Galactic neutrino flux at neutrino energies above 200 TeV with the Baikal Gigaton Volume Detector | V. A. Allakhverdyan et.al. | 2411.05608 | null |
2024-11-08 | Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs | Ryoto Ando et.al. | 2411.05574 | null |
2024-11-08 | Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion | Nan Song et.al. | 2411.05544 | null |
2024-11-07 | SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing | Jun-Kun Chen et.al. | 2411.05006 | null |
2024-11-07 | Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models | Shuhong Zheng et.al. | 2411.05005 | null |
2024-11-07 | ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning | David Junhao Zhang et.al. | 2411.05003 | null |
2024-11-07 | SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation | Koichi Namekata et.al. | 2411.04989 | null |
2024-11-07 | Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification | Mischa Dombrowski et.al. | 2411.04956 | null |
2024-11-07 | DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Wenqiang Sun et.al. | 2411.04928 | null |
2024-11-07 | MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views | Yuedong Chen et.al. | 2411.04924 | link |
2024-11-07 | Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion | Kaizhe Hu et.al. | 2411.04919 | link |
2024-11-07 | Sharp extinction rates for positive solutions of fast diffusion equations | Tobias König et.al. | 2411.04783 | null |
2024-11-06 | Community Forensics: Using Thousands of Generators to Train Fake Image Detectors | Jeongsoo Park et.al. | 2411.04125 | null |
2024-11-06 | Manifold Diffusion Geometry: Curvature, Tangent Spaces, and Dimension | Iolo Jones et.al. | 2411.04100 | link |
2024-11-06 | Simulation of solar energetic particle events originated from coronal mass ejection shocks with a data-driven physics-based transport model | Lei Cheng et.al. | 2411.04095 | null |
2024-11-06 | A Multi-level Monte Carlo simulation for invariant distribution of Markovian switching Lévy-driven SDEs with super-linearly growth coefficients | Hoang-Viet Nguyen et.al. | 2411.04081 | null |
2024-11-06 | The Lorentz Gas in a Mean-Field Potential: Weak Coupling and Diffusive Regime | Dominik Nowak et.al. | 2411.04076 | null |
2024-11-06 | On a diffuse interface model for diblock copolymers interacting with an electric field | Helmut Abels et.al. | 2411.04074 | null |
2024-11-06 | Imaging heat transport in suspended diamond nanostructures with integrated spin defect thermometers | Valentin Goblot et.al. | 2411.04065 | null |
2024-11-06 | Space-Time Spectral Element Tensor Network Approach for Time Dependent Convection Diffusion Reaction Equation with Variable Coefficients | Dibyendu Adak et.al. | 2411.04026 | null |
2024-11-06 | Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging | Yuan Bi et.al. | 2411.04004 | null |
2024-11-06 | ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy | Chenrui Tie et.al. | 2411.03990 | null |
2024-11-05 | Production and propagation of secondary antideuteron in the Galaxy | Luis Fernando Galicia Cruztitla et.al. | 2411.03298 | null |
2024-11-05 | DiT4Edit: Diffusion Transformer for Image Editing | Kunyu Feng et.al. | 2411.03286 | null |
2024-11-05 | DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models | Ying Zhou et.al. | 2411.03250 | null |
2024-11-05 | bursty_dynamics: A Python Package for Exploring the Temporal Properties of Longitudinal Data | Alisha Angdembe et.al. | 2411.03210 | null |
2024-11-05 | Electron-irradiation effects on monolayer MoS2 at elevated temperatures | Carsten Speckmann et.al. | 2411.03200 | null |
2024-11-05 | On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models | Tariq Berrada Ifriqi et.al. | 2411.03177 | null |
2024-11-05 | Unleashing the power of novel conditional generative approaches for new materials discovery | Lev Novitskiy et.al. | 2411.03156 | link |
2024-11-05 | A numerical study on temperature destratification induced by bubble plumes in idealized reservoirs | Yiran Li et.al. | 2411.03120 | null |
2024-11-05 | Coupling Methods and Applications on the Exponential Contractivity for Path Dependent McKean-Vlasov SDEs | Xing Huang et.al. | 2411.03104 | null |
2024-11-05 | Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising | Tao Huang et.al. | 2411.03053 | null |
2024-11-04 | Adaptive Caching for Faster Video Generation with Diffusion Transformers | Kumara Kahatapitiya et.al. | 2411.02397 | null |
2024-11-04 | Training-free Regional Prompting for Diffusion Transformers | Anthony Chen et.al. | 2411.02395 | link |
2024-11-04 | How Far is Video Generation from World Model: A Physical Law Perspective | Bingyi Kang et.al. | 2411.02385 | null |
2024-11-04 | Quantum Ornstein-Zernike Theory for Two-Temperature Two-Component Plasmas | Zachary A. Johnson et.al. | 2411.02363 | null |
2024-11-04 | MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D | Wei Cheng et.al. | 2411.02336 | null |
2024-11-04 | Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition | Xinkai Liu et.al. | 2411.02334 | null |
2024-11-04 | Non-parametric Inference for Diffusion Processes: A Computational Approach via Bayesian Inversion for PDEs | Maximilian Kruse et.al. | 2411.02324 | null |
2024-11-04 | LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation | Mufei Li et.al. | 2411.02322 | link |
2024-11-04 | Convolutional neural networks applied to differential dynamic microscopy reduces noise when quantifying heterogeneous dynamics | Gildardo Martinez et.al. | 2411.02314 | null |
2024-11-04 | Grouped Discrete Representation for Object-Centric Learning | Rongzhen Zhao et.al. | 2411.02299 | null |
2024-10-31 | Bridging Geometric States via Geometric Diffusion Bridge | Shengjie Luo et.al. | 2410.24220 | null |
2024-10-31 | DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion | Weicai Ye et.al. | 2410.24203 | link |
2024-10-31 | AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties | Xiayan Ji et.al. | 2410.24178 | null |
2024-10-31 | Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation | Fu Feng et.al. | 2410.24160 | null |
2024-10-31 | Scaling Concept With Text-Guided Diffusion Models | Chao Huang et.al. | 2410.24151 | null |
2024-10-31 | Nonlinear Two-Level Schwarz Methods: A Parallel Implementation in FROSch | Alexander Heinlein et.al. | 2410.24138 | null |
2024-10-31 | Modeling Brownian Motion as a Timelapse of the Physical, Persistent, Trajectory | Ludovico Cademartiri et.al. | 2410.24137 | null |
2024-10-31 | 3D-ViTac: Learning Fine-Grained Manipulation with Visuo-Tactile Sensing | Binghao Huang et.al. | 2410.24091 | null |
2024-10-31 | Deep Chandra Observations of NGC 5728. III: Probing the High-Resolution X-ray Morphology and Multiphase ISM Interactions in the Circumnuclear Region | Anna Trindade Falcao et.al. | 2410.24061 | null |
2024-10-31 | Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure | Xiang Li et.al. | 2410.24060 | link |
2024-10-30 | ReferEverything: Towards Segmenting Everything We Can Speak of in Videos | Anurag Bagchi et.al. | 2410.23287 | null |
2024-10-30 | Provable acceleration for diffusion models under minimal assumptions | Gen Li et.al. | 2410.23285 | null |
2024-10-30 | RelationBooth: Towards Relation-Aware Customized Object Generation | Qingyu Shi et.al. | 2410.23280 | null |
2024-10-30 | SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation | Yining Hong et.al. | 2410.23277 | null |
2024-10-30 | Multi-student Diffusion Distillation for Better One-step Generators | Yanke Song et.al. | 2410.23274 | null |
2024-10-30 | Chapman-Enskog theory for nearly integrable quantum gases | Maciej Łebek et.al. | 2410.23209 | null |
2024-10-30 | Diffusive shock acceleration of dust grains at supernova remnants | P. Cristofari et.al. | 2410.23190 | null |
2024-10-30 | CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense | Mingkun Zhang et.al. | 2410.23091 | link |
2024-10-30 | Controlling Language and Diffusion Models by Transporting Activations | Pau Rodriguez et.al. | 2410.23054 | link |
2024-10-30 | Regularity and stability for the Gibbs conditioning principle on path space via McKean-Vlasov control | Louis-Pierre Chaintron et.al. | 2410.23016 | null |
2024-10-29 | Driving forces in cell migration and pattern formation in a soft tissue | Amabile Tatone et.al. | 2410.22273 | null |
2024-10-29 | Surface reconstruction from point cloud using a semi-Lagrangian scheme with local interpolator | Silvia Preda et.al. | 2410.22205 | null |
2024-10-29 | Confinement of relativistic particles in the vicinity of accelerators: a key for understanding the anomalies in secondary cosmic rays | Rui-zhi Yang et.al. | 2410.22199 | null |
2024-10-29 | An alternating low-rank projection approach for partial differential equations with random inputs | Guanjie Wang et.al. | 2410.22183 | null |
2024-10-29 | Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models | Raman Dutt et.al. | 2410.22149 | link |
2024-10-29 | Averaging principle for multiscale controlled jump diffusions and associated nonlocal HJB equations | Qi Zhang et.al. | 2410.22141 | null |
2024-10-30 | Thermodynamic uncertainty relation for systems with active Ornstein-Uhlenbeck particles | Hyeong-Tark Han et.al. | 2410.22126 | null |
2024-10-29 | TractShapeNet: Efficient Multi-Shape Learning with 3D Tractography Point Clouds | Yui Lo et.al. | 2410.22099 | link |
2024-10-29 | Generalized arcsine laws for a sluggish random walker with subdiffusive growth | Giuseppe Del Vecchio Del Vecchio et.al. | 2410.22097 | null |
2024-10-29 | Variational inference for pile-up removal at hadron colliders with diffusion models | Malte Algren et.al. | 2410.22074 | null |
2024-10-28 | On Inductive Biases That Enable Generalization of Diffusion Transformers | Jie An et.al. | 2410.21273 | link |
2024-10-28 | One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation | Zhendong Wang et.al. | 2410.21257 | null |
2024-10-28 | On learning higher-order cumulants in diffusion models | Gert Aarts et.al. | 2410.21212 | null |
2024-10-28 | Trajectory Flow Matching with Applications to Clinical Time Series Modeling | Xi Zhang et.al. | 2410.21154 | link |
2024-10-28 | Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences | Zhihao Zhao et.al. | 2410.21130 | null |
2024-10-28 | Tree-Wasserstein Distance for High Dimensional Data with a Latent Feature Hierarchy | Ya-Wei Eileen Lin et.al. | 2410.21107 | null |
2024-10-28 | Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models | Wenda Li et.al. | 2410.21088 | link |
2024-10-28 | Confined active particles with spatially dependent Lorentz force: an odd twist to the "best Fokker-Planck approximation" | René Wittmann et.al. | 2410.21087 | null |
2024-10-28 | Federated Time Series Generation on Feature and Temporally Misaligned Data | Chenrui Fan et.al. | 2410.21072 | null |
2024-10-28 | Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework | Vladimir Arkhipkin et.al. | 2410.21061 | link |
2024-10-25 | Adversarial Environment Design via Regret-Guided Diffusion Models | Hojun Chung et.al. | 2410.19715 | null |
2024-10-25 | Sylvester-Preconditioned Adaptive-Rank Implicit Time Integrators for Advection-Diffusion Equations with Inhomogeneous Coefficients | Hamad El Kahza et.al. | 2410.19662 | null |
2024-10-25 | DiffGS: Functional Gaussian Splatting Diffusion | Junsheng Zhou et.al. | 2410.19657 | null |
2024-10-25 | Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving | Liu Yunhao et.al. | 2410.19639 | null |
2024-10-25 | Improved performance of polycrystalline antiferromagnet/ferromagnet stack by nitrogen assisted deposition | Y. Khaydukov et.al. | 2410.19620 | null |
2024-10-25 | Diffusion models for lattice gauge field simulations | Qianteng Zhu et.al. | 2410.19602 | null |
2024-10-25 | Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series | Ilan Naiman et.al. | 2410.19538 | null |
2024-10-25 | Ensemble Data Assimilation for Particle-based Methods | Marius Duvillard et.al. | 2410.19525 | null |
2024-10-25 | Nutation-orbit resonances: The origin of the chaotic rotation of Hyperion and the barrel instability | Max Goldberg et.al. | 2410.19518 | null |
2024-10-25 | Physics-based inverse modeling of battery degradation with Bayesian methods | Micha C. J. Philipp et.al. | 2410.19478 | null |
2024-10-24 | MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms | Ling-Hao Chen et.al. | 2410.18977 | null |
2024-10-24 | 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation | Hansheng Chen et.al. | 2410.18974 | link |
2024-10-24 | On the Crucial Role of Initialization for Matrix Factorization | Bingcong Li et.al. | 2410.18965 | null |
2024-10-24 | Stable Consistency Tuning: Understanding and Improving Consistency Models | Fu-Yun Wang et.al. | 2410.18958 | link |
2024-10-24 | Generation of synthetic financial time series by diffusion models | Tomonori Takahashi et.al. | 2410.18897 | null |
2024-10-24 | Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences | Weijian Luo et.al. | 2410.18881 | null |
2024-10-24 | Diffusion of impurities in a moderately dense confined granular gas | Rubén Gómez González et.al. | 2410.18874 | null |
2024-10-24 | On the mean-field limit of diffusive games through the master equation: extreme value analysis | Erhan Bayraktar et.al. | 2410.18869 | null |
2024-10-24 | The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods | Linda Laurier et.al. | 2410.18866 | null |
2024-10-24 | A diffusion MRI model for random walks confined on cylindrical surfaces: Towards non-invasive quantification of myelin sheath radius | Erick J Canales-Rodríguez et.al. | 2410.18842 | null |
2024-10-23 | DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes | Hengwei Bian et.al. | 2410.18084 | null |
2024-10-23 | Prioritized Generative Replay | Renhao Wang et.al. | 2410.18082 | null |
2024-10-23 | Training Free Guided Flow Matching with Optimal Control | Luran Wang et.al. | 2410.18070 | null |
2024-10-23 | EON: A practical energy-preserving rough diffuse BRDF | Jamie Portsmouth et.al. | 2410.18026 | null |
2024-10-23 | Random space-time sampling and reconstruction of sparse bandlimited graph diffusion field | Longxiu Huang et.al. | 2410.18005 | null |
2024-10-23 | Optical Generative Models | Shiqi Chen et.al. | 2410.17970 | null |
2024-10-23 | A Wavelet Diffusion GAN for Image Super-Resolution | Lorenzo Aloisi et.al. | 2410.17966 | null |
2024-10-23 | Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation | Wenfang Yao et.al. | 2410.17918 | link |
2024-10-23 | Scaling Diffusion Language Models via Adaptation from Autoregressive Models | Shansan Gong et.al. | 2410.17891 | link |
2024-10-23 | Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech | Danilo de Oliveira et.al. | 2410.17834 | null |
2024-10-22 | Ergodic Risk Sensitive Control of Markovian Multiclass Many-Server Queues with Abandonment | Sumith Reddy Anugu et.al. | 2410.17205 | null |
2024-10-22 | Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding | Yasha Ektefaie et.al. | 2410.17173 | link |
2024-10-22 | On Lyapunov Conditions for the Well-Posedness of McKean-Vlasov Stochastic Differential Delay Equations | Dan Noelck et.al. | 2410.17120 | null |
2024-10-22 | Dust ring and gap formation by gas flow induced by low-mass planets embedded in protoplanetary disks |
Ayumu Kuwahara et.al. | 2410.16996 | null |
2024-10-22 | DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization | Haowei Zhu et.al. | 2410.16942 | null |
2024-10-22 | Hierarchical Clustering for Conditional Diffusion in Image Generation | Jorge da Silva Goncalves et.al. | 2410.16910 | link |
2024-10-22 | MBD: Multi b-value Denoising of Diffusion Magnetic Resonance Images | Jakub Jurek et.al. | 2410.16898 | null |
2024-10-22 | VistaDream: Sampling multiview consistent images for single-view scene reconstruction | Haiping Wang et.al. | 2410.16892 | null |
2024-10-22 | Inverse first-passage problems of a diffusion with resetting | Mario Abundo et.al. | 2410.16889 | null |
2024-10-22 | Accelerated Quantum Circuit Monte-Carlo Simulation for Heavy Quark Thermalization | Wenyang Qian et.al. | 2410.16863 | null |
2024-10-21 | MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors | Honghua Chen et.al. | 2410.16272 | null |
2024-10-21 | 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Xi Liu et.al. | 2410.16266 | null |
2024-10-21 | Role of obstacle softness in the diffusive behavior of active Particles | Ankit Gupta et.al. | 2410.16223 | null |
2024-10-21 | A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data | Simon Deltadahl et.al. | 2410.16177 | null |
2024-10-21 | Validity of Prandtl's boundary layer from the Boltzmann theory | Chanwoo Kim et.al. | 2410.16160 | null |
2024-10-22 | Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models | Giannis Daras et.al. | 2410.16152 | null |
2024-10-21 | Universal Linear Response of the Mean First-Passage Time | Tommer D. Keidar et.al. | 2410.16129 | null |
2024-10-21 | SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation | Xinyi Zhou et.al. | 2410.16119 | null |
2024-10-21 | Continuous Speech Synthesis using per-token Latent Diffusion | Arnon Turetzky et.al. | 2410.16048 | null |
2024-10-21 | The essential m-dissipativity for degenerate infinite dimensional stochastic Hamiltonian systems and applications | Benedikt Eisenhuth et.al. | 2410.15993 | null |
2024-10-18 | A GARCH model with two volatility components and two driving factors | Luca Vincenzo Ballestra et.al. | 2410.14585 | link |
2024-10-18 | Semi-Implicit Lagrangian Voronoi Approximation for Compressible Viscous Fluid Flows | Ondřej Kincl et.al. | 2410.14564 | null |
2024-10-18 | Intrinsic cell-to-cell variance from experimental single-cell motility data | Anton Klimek et.al. | 2410.14561 | null |
2024-10-18 | Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior | Calvin-Khang Ta et.al. | 2410.14540 | null |
2024-10-18 | Diffusion-based Semi-supervised Spectral Algorithm for Regression on Manifolds | Weichun Xia et.al. | 2410.14539 | null |
2024-10-18 | LEAD: Latent Realignment for Human Motion Diffusion | Nefeli Andreou et.al. | 2410.14508 | null |
2024-10-18 | Reinforcement Learning in Non-Markov Market-Making | Luca Lalor et.al. | 2410.14504 | null |
2024-10-18 | ANT: Adaptive Noise Schedule for Time Series Diffusion Models | Seunghan Lee et.al. | 2410.14488 | link |
2024-10-18 | DRL Optimization Trajectory Generation via Wireless Network Intent-Guided Diffusion Models for Optimizing Resource Allocation | Junjie Wu et.al. | 2410.14481 | null |
2024-10-18 | LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes | Juliette Marrie et.al. | 2410.14462 | null |
2024-10-17 | Diffusing States and Matching Scores: A New Framework for Imitation Learning | Runzhe Wu et.al. | 2410.13855 | link |
2024-10-17 | Influence Functions for Scalable Data Attribution in Diffusion Models | Bruno Mlodozeniec et.al. | 2410.13850 | null |
2024-10-17 | DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control | Yujie Wei et.al. | 2410.13830 | null |
2024-10-17 | Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning | Xiaodan Xing et.al. | 2410.13823 | link |
2024-10-17 | Enhancing universal machine learning potentials with polarizable long-range interactions | Rongzhi Gao et.al. | 2410.13820 | link |
2024-10-17 | ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution | Junhao Gu et.al. | 2410.13807 | null |
2024-10-17 | Arbitrarily-Conditioned Multi-Functional Diffusion for Multi-Physics Emulation | Da Long et.al. | 2410.13794 | null |
2024-10-17 | DPLM-2: A Multimodal Diffusion Protein Language Model | Xinyou Wang et.al. | 2410.13782 | null |
2024-10-17 | Conductance in graphene through double laser barriers and magnetic field | Rachid El Aitouni et.al. | 2410.13771 | null |
2024-10-17 | Probing the Latent Hierarchical Structure of Data via Diffusion Models | Antonio Sclocchi et.al. | 2410.13770 | null |
2024-10-16 | Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts | Hongcheng Gao et.al. | 2410.12777 | link |
2024-10-16 | Should exponential integrators be used for advection-dominated problems? | Lukas Einkemmer et.al. | 2410.12765 | null |
2024-10-16 | SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation | Jaehong Yoon et.al. | 2410.12761 | null |
2024-10-16 | Impact of Ion Mobility on Electron Density and Temperature in Hypersonic Flows | Felipe Martin Rodriguez Fuentes et.al. | 2410.12760 | null |
2024-10-16 | Signature of Vertical Mixing in Hydrogen-dominated Exoplanet Atmospheres | Vikas Soni et.al. | 2410.12737 | null |
2024-10-16 | Smooth Geometry of Diffusion Algebras | Andrés Rubiano et.al. | 2410.12701 | null |
2024-10-16 | Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization | Xingqi Wang et.al. | 2410.12700 | link |
2024-10-16 | AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing | DuoSheng Chen et.al. | 2410.12696 | null |
2024-10-16 | Hamiltonian bridge: A physics-driven generative framework for targeted pattern control | Vishaal Krishnan et.al. | 2410.12665 | null |
2024-10-16 | Constrained Posterior Sampling: Time Series Generation with Hard Constraints | Sai Shankar Narasimhan et.al. | 2410.12652 | null |
2024-10-15 | High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion | Junhwa Hur et.al. | 2410.11838 | null |
2024-10-15 | On the Effectiveness of Dataset Alignment for Fake Image Detection | Anirudh Sundara Rajan et.al. | 2410.11835 | null |
2024-10-15 | Bayesian Experimental Design via Contrastive Diffusions | Jacopo Iollo et.al. | 2410.11826 | link |
2024-10-15 | Improving Long-Text Alignment for Text-to-Image Diffusion Models | Luping Liu et.al. | 2410.11817 | link |
2024-10-15 | SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing | Zhiyuan Zhang et.al. | 2410.11815 | null |
2024-10-15 | Random walks with long-range memory on networks | Ana Gabriela Guerrero-Estrada et.al. | 2410.11814 | null |
2024-10-16 | Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices | Zhiyuan Ma et.al. | 2410.11795 | null |
2024-10-15 | Solving The Dynamic Volatility Fitting Problem: A Deep Reinforcement Learning Approach | Emmanuel Gnabeyeu et.al. | 2410.11789 | null |
2024-10-15 | Measure estimation on a manifold explored by a diffusion process | Vincent Divol et.al. | 2410.11777 | null |
2024-10-15 | Probabilistic Principles for Biophysics and Neuroscience: Entropy Production, Bayesian Mechanics & the Free-Energy Principle | Lancelot Da Costa et.al. | 2410.11735 | null |
2024-10-14 | Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Jingzhi Bao et.al. | 2410.10821 | link |
2024-10-14 | Depth Any Video with Scalable Synthetic Data | Honghui Yang et.al. | 2410.10815 | link |
2024-10-14 | HART: Efficient Visual Generation with Hybrid Autoregressive Transformer | Haotian Tang et.al. | 2410.10812 | link |
2024-10-14 | TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction | Qingze et.al. | 2410.10804 | link |
2024-10-14 | Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies | Yanjie Ze et.al. | 2410.10803 | link |
2024-10-14 | Boosting Camera Motion Control for Video Diffusion Transformers | Soon Yau Cheong et.al. | 2410.10802 | null |
2024-10-15 | MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling | Jian Yang et.al. | 2410.10798 | null |
2024-10-14 | Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations | Litu Rout et.al. | 2410.10792 | null |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | ControlMM: Controllable Masked Motion Generation | Ekkasit Pinyoanuntapong et.al. | 2410.10780 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Linear Convergence of Diffusion Models Under the Manifold Hypothesis | Peter Potaptchik et.al. | 2410.09046 | null |
2024-10-11 | Semantic Score Distillation Sampling for Compositional Text-to-3D Generation | Ling Yang et.al. | 2410.09009 | link |
2024-10-11 | Macrotransport of active particles in periodic channels and fields: rectification and dispersion | Zhiwei Peng et.al. | 2410.09007 | null |
2024-10-11 | WaveDiffusion: Exploring Full Waveform Inversion via Joint Diffusion in the Latent Space | Hanchen Wang et.al. | 2410.09002 | null |
2024-10-11 | Revised Point-Spread Functions for the Atmospheric Imaging Assembly onboard the Solar Dynamics Observatory | Stefan Hofmeister et.al. | 2410.08967 | null |
2024-10-11 | DiffPO: A causal diffusion model for learning distributions of potential outcomes | Yuchen Ma et.al. | 2410.08924 | null |
2024-10-11 | An End-to-End Deep Learning Method for Solving Nonlocal Allen-Cahn and Cahn-Hilliard Phase-Field Models | Yuwei Geng et.al. | 2410.08914 | null |
2024-10-11 | Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRI | Moritz Piening et.al. | 2410.08894 | link |
2024-10-11 | Simulating anisotropic diffusion processes with smoothed particle hydrodynamics | Xiaojing Tang et.al. | 2410.08888 | null |
2024-10-10 | Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision | Shengcao Cao et.al. | 2410.08209 | null |
2024-10-10 | DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Xiaoxiao He et.al. | 2410.08207 | null |
2024-10-10 | HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation | Shanyan Guan et.al. | 2410.08192 | null |
2024-10-10 | DifFRelight: Diffusion-Based Facial Performance Relighting | Mingming He et.al. | 2410.08188 | null |
2024-10-10 | Scaling Laws For Diffusion Transformers | Zhengyang Liang et.al. | 2410.08184 | null |
2024-10-10 | ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion | Zitian Zhang et.al. | 2410.08168 | null |
2024-10-10 | DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation | Jiatao Gu et.al. | 2410.08159 | null |
2024-10-10 | Progressive Autoregressive Video Diffusion Models | Desai Xie et.al. | 2410.08151 | link |
2024-10-10 | Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction | Jarrid Rector-Brooks et.al. | 2410.08134 | null |
2024-10-10 | CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack Segmentation | Xiaoyan Jiang et.al. | 2410.08100 | link |
2024-10-09 | Simulating realistic self-interacting dark matter models including small and large-angle scattering | Cenanda Arido et.al. | 2410.07175 | null |
2024-10-09 | IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation | Xinchen Zhang et.al. | 2410.07171 | link |
2024-10-09 | AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation | Yukang Cao et.al. | 2410.07164 | null |
2024-10-09 | InstructG2I: Synthesizing Images from Multimodal Attributed Graphs | Bowen Jin et.al. | 2410.07157 | link |
2024-10-09 | Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis | Bohan Zeng et.al. | 2410.07155 | link |
2024-10-09 | Measuring Minority Carrier Diffusion Length Using High-Injection Scanning Photocurrent Microscopy | Xiujun Lian et.al. | 2410.07089 | null |
2024-10-09 | FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation | Yishu Li et.al. | 2410.07078 | null |
2024-10-09 | A short proof of diffusivity for the directed polymers in the weak disorder phase | Hubert Lacoin et.al. | 2410.07068 | null |
2024-10-09 | Active fluids form system-spanning filamentary networks | Paarth Gulati et.al. | 2410.07058 | null |
2024-10-09 | Transients by Black Hole Formation from Red Supergiants: Impact of Dense Circumstellar Matter | Daichi Tsuna et.al. | 2410.07055 | link |
2024-10-07 | DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control | Kaifeng Zhao et.al. | 2410.05260 | null |
2024-10-07 | GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting | Yukang Cao et.al. | 2410.05259 | null |
2024-10-07 | SePPO: Semi-Policy Preference Optimization for Diffusion Alignment | Daoan Zhang et.al. | 2410.05255 | link |
2024-10-07 | Tritium-Lean Fusion Power Plants with Asymmetric Deuterium-Tritium Transport and Pumping | J. F. Parisi et.al. | 2410.05238 | null |
2024-10-07 | DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration | Yongtai Zhuo et.al. | 2410.05234 | link |
2024-10-07 | Presto! Distilling Steps and Layers for Accelerating Music Generation | Zachary Novack et.al. | 2410.05167 | null |
2024-10-07 | A Simulation-Free Deep Learning Approach to Stochastic Optimal Control | Mengjian Hua et.al. | 2410.05163 | null |
2024-10-07 | Formation of Anisotropic Polarons in Antimony Selenide | Yijie Shi et.al. | 2410.05155 | null |
2024-10-07 | Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer | Siyuan Hou et.al. | 2410.05151 | null |
2024-10-07 | Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information | Timofey Efimov et.al. | 2410.05143 | null |
2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665 | null |
2024-10-04 | Connecting Lyman- |
M. Riley Owens et.al. | 2410.03660 | null |
2024-10-04 | Geometric Representation Condition Improves Equivariant Molecule Generation | Zian Li et.al. | 2410.03655 | null |
2024-10-04 | Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models | Chumeng Liang et.al. | 2410.03640 | link |
2024-10-04 | Stabilizing the Consistent Quasidiffusion Method with Linear Prolongation | Dean Wang et.al. | 2410.03605 | null |
2024-10-04 | How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework | Yinuo Ren et.al. | 2410.03601 | null |
2024-10-04 | Free boundary problem governed by a non-linear diffusion-convection equation with Neumann condition | Adriana C. Briozzo et.al. | 2410.03564 | null |
2024-10-04 | Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features | Benyuan Meng et.al. | 2410.03558 | link |
2024-10-04 | Seizure freedom after surgical resection of diffusion-weighted MRI abnormalities | Jonathan Horsley et.al. | 2410.03548 | null |
2024-10-04 | Generative Artificial Intelligence for Navigating Synthesizable Chemical Space | Wenhao Gao et.al. | 2410.03494 | link |
2024-10-03 | Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models | Zhengfeng Lai et.al. | 2410.02740 | null |
2024-10-03 | Discovery of three magnetic He-sdOs with SALT | M. Dorsch et.al. | 2410.02737 | null |
2024-10-03 | NETS: A Non-Equilibrium Transport Sampler | Michael S. Albergo et.al. | 2410.02711 | null |
2024-10-03 | SteerDiff: Steering towards Safe Text-to-Image Diffusion Models | Hongxiang Zhang et.al. | 2410.02710 | null |
2024-10-03 | ControlAR: Controllable Image Generation with Autoregressive Models | Zongming Li et.al. | 2410.02705 | link |
2024-10-03 | GUD: Generation with Unified Diffusion | Mathis Gerdes et.al. | 2410.02667 | null |
2024-10-03 | AGN STROM 2: X. The origin of the interband continuum delays in Mrk 817 | Hagai Netzer et.al. | 2410.02652 | null |
2024-10-03 | Undesirable Memorization in Large Language Models: A Survey | Ali Satvaty et.al. | 2410.02650 | null |
2024-10-03 | Efficient calibration of the shifted square-root diffusion model to credit default swap spreads using asymptotic approximations | Ankush Agarwal et.al. | 2410.02645 | null |
2024-10-03 | Diffusion-based Extreme Image Compression with Compressed Feature Initialization | Zhiyuan Li et.al. | 2410.02640 | link |
2024-10-02 | A Catalog of Pulsar X-ray Filaments | Jack T. Dinsmore et.al. | 2410.01807 | null |
2024-10-02 | FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images | Cheng Zhang et.al. | 2410.01801 | null |
2024-10-02 | Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space | Yangming Li et.al. | 2410.01796 | null |
2024-10-02 | Dynamical-generative downscaling of climate model ensembles | Ignacio Lopez-Gomez et.al. | 2410.01776 | null |
2024-10-02 | Integrable Matrix Probabilistic Diffusions and the Matrix Stochastic Heat Equation | Alexandre Krajenbrink et.al. | 2410.01764 | null |
2024-10-02 | ImageFolder: Autoregressive Image Generation with Folded Tokens | Xiang Li et.al. | 2410.01756 | link |
2024-10-02 | VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models | Kailai Feng et.al. | 2410.01738 | link |
2024-10-02 | HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration | Yushi Huang et.al. | 2410.01723 | null |
2024-10-02 | COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation | Mingzhen Sun et.al. | 2410.01718 | null |
2024-10-02 | COSMIC: Compress Satellite Images Efficiently via Diffusion Compensation | Ziyuan Zhang et.al. | 2410.01698 | link |
2024-09-30 | Inverse Painting: Reconstructing The Painting Process | Bowei Chen et.al. | 2409.20556 | null |
2024-09-30 | COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models | Divyanshu Daiya et.al. | 2409.20502 | null |
2024-09-30 | FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing | Lingling Cai et.al. | 2409.20500 | null |
2024-09-30 | Persistent homology classifies parameter dependence of patterns in Turing systems | Reemon Spector et.al. | 2409.20491 | null |
2024-09-30 | POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator | Eugenio Lomurno et.al. | 2409.20447 | null |
2024-09-30 | Multiwavelength Galactic Center gamma-ray observations explained by a unified cosmic-ray dynamics model | Andrés Scherer et.al. | 2409.20436 | null |
2024-09-30 | Lateral diffusion in 2-micron InGaAs/GaAsSb superlattice planar diodes using atomic layer deposition of ZnO | Manisha Muduli et.al. | 2409.20406 | null |
2024-09-30 | Conductance properties of an |
Mijanur Islam et.al. | 2409.20395 | null |
2024-09-30 | Topology affects diffusion dynamics of ring polymers in dilute solutions | Prabeen Kumar Pattnayak et.al. | 2409.20386 | null |
2024-09-30 | Symbol-based multilevel block |
Sean Y. Hon et.al. | 2409.20363 | null |
2024-09-27 | PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation | Shaowei Liu et.al. | 2409.18964 | link |
2024-09-27 | Gen Li et.al. | 2409.18959 | null | |
2024-09-27 | ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions | Wenfeng Huang et.al. | 2409.18932 | null |
2024-09-27 | AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Huizi Yu et.al. | 2409.18924 | null |
2024-09-27 | Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors | Yunlong Lin et.al. | 2409.18899 | null |
2024-09-27 | Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis | Songrui Wang et.al. | 2409.18897 | null |
2024-09-27 | Explainable Artifacts for Synthetic Western Blot Source Attribution | João Phillipe Cardenuto et.al. | 2409.18881 | link |
2024-09-27 | CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition | Zhonglin Sun et.al. | 2409.18876 | link |
2024-09-27 | Emu3: Next-Token Prediction is All You Need | Xinlong Wang et.al. | 2409.18869 | null |
2024-09-27 | MECG-E: Mamba-based ECG Enhancer for Baseline Wander Removal | Kuo-Hsuan Hung et.al. | 2409.18828 | link |
2024-09-26 | FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Wenliang Zhao et.al. | 2409.18128 | link |
2024-09-26 | Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Jing He et.al. | 2409.18124 | null |
2024-09-26 | EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation | Jiaxiang Tang et.al. | 2409.18114 | null |
2024-09-26 | StackGen: Generating Stable Structures from Silhouettes via Diffusion | Luzhe Sun et.al. | 2409.18098 | null |
2024-09-26 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-09-26 | Stable Video Portraits | Mirela Ostrek et.al. | 2409.18083 | null |
2024-09-26 | The radio halo in PLCKESZ G171.94 |
Ramananda Santra et.al. | 2409.18075 | null |
2024-09-26 | Low Photon Number Non-Invasive Imaging Through Time-Varying Diffusers | Adrian Makowski et.al. | 2409.18072 | null |
2024-09-26 | Radio-FIR correlation- A probe into cosmic ray propagation in the nearby galaxy IC 342 | M. R. Nasirzadeh et.al. | 2409.17999 | null |
2024-09-26 | Distributed Invariant Unscented Kalman Filter based on Inverse Covariance Intersection with Intermittent Measurements | Zhian Ruan et.al. | 2409.17997 | null |
2024-09-25 | DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion | Yukun Huang et.al. | 2409.17145 | link |
2024-09-25 | Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model | Xinfeng Wei et.al. | 2409.17104 | null |
2024-09-25 | Effects of the internal temperature on vertical mixing and on cloud structures in Ultra Hot Jupiters | Pascal A. Noti et.al. | 2409.17101 | null |
2024-09-25 | Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification | Xinrui Zhou et.al. | 2409.17091 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis | Fangshuo Zhou et.al. | 2409.17049 | link |
2024-09-25 | GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design | Phillip Mueller et.al. | 2409.17045 | null |
2024-09-25 | Cloud technologies, firm growth and industry concentration: Evidence from France | Bernardo Caldarola et.al. | 2409.17035 | null |
2024-09-25 | Decomposition of Friction Coefficients to Analyze Hydration Effects on a C ${60}$(OH)${\rm n}$ | Tomoya Iwashita et.al. | 2409.17028 | null |
2024-09-25 | Single Image, Any Face: Generalisable 3D Face Generation | Wenqing Wang et.al. | 2409.16990 | null |
2024-09-18 | Vista3D: Unravel the 3D Darkside of a Single Image | Qiuhong Shen et.al. | 2409.12193 | link |
2024-09-18 | DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control | Zichen Jeff Cui et.al. | 2409.12192 | null |
2024-09-18 | Massively Multi-Person 3D Human Motion Forecasting with Scene Context | Felix B Mueller et.al. | 2409.12189 | link |
2024-09-18 | Blind Deconvolution on Graphs: Exact and Stable Recovery | Chang Ye et.al. | 2409.12164 | null |
2024-09-18 | MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion | Kalakonda Sai Shashank et.al. | 2409.12140 | null |
2024-09-18 | Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance | Jaehoon Joo et.al. | 2409.12099 | null |
2024-09-18 | Uncovering liquid-substrate fluctuation effects on crystal growth and disordered hyperuniformity of two-dimensional materials | S. K. Mkhonta et.al. | 2409.12090 | null |
2024-09-18 | Denoising diffusion models for high-resolution microscopy image restoration | Pamela Osuna-Vargas et.al. | 2409.12078 | null |
2024-09-18 | LEMON: Localized Editing with Mesh Optimization and Neural Shaders | Furkan Mert Algan et.al. | 2409.12024 | null |
2024-09-19 | On some singularly perturbed elliptic systems modeling partial segregation: uniform Hölder estimates and basic properties of the limits | Nicola Soave et.al. | 2409.11976 | null |
2024-09-17 | Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion | Zhenwei Wang et.al. | 2409.11406 | null |
2024-09-17 | A lattice Boltzmann method for Biot's consolidation model of linear poroelasticity | Stephan B. Lunowa et.al. | 2409.11382 | null |
2024-09-17 | Ultrasound Image Enhancement with the Variance of Diffusion Models | Yuxin Zhang et.al. | 2409.11380 | link |
2024-09-17 | OSV: One Step is Enough for High-Quality Image to Video Generation | Xiaofeng Mao et.al. | 2409.11367 | null |
2024-09-17 | Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Gonzalo Martin Garcia et.al. | 2409.11355 | link |
2024-09-17 | OmniGen: Unified Image Generation | Shitao Xiao et.al. | 2409.11340 | link |
2024-09-17 | fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction | Jianxiong Gao et.al. | 2409.11315 | null |
2024-09-17 | DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models | Avirup Das et.al. | 2409.11292 | null |
2024-09-17 | Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models | Tianqi Chen et.al. | 2409.11219 | null |
2024-09-17 | SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds | Zhixing Hou et.al. | 2409.11195 | null |
2024-09-16 | Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation | Noah Buchanan et.al. | 2409.10494 | null |
2024-09-16 | SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing | Qi Qian et.al. | 2409.10476 | null |
2024-09-16 | MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion | Lehong Wu et.al. | 2409.10473 | null |
2024-09-16 | Mamba-ST: State Space Model for Efficient Style Transfer | Filippo Botti et.al. | 2409.10385 | link |
2024-09-16 | Taming Diffusion Models for Image Restoration: A Review | Ziwei Luo et.al. | 2409.10353 | null |
2024-09-16 | Fairness, not Emotion, Drives Socioeconomic Decision Making | Rudra Mukhopadhyay et.al. | 2409.10322 | null |
2024-09-16 | Controllability and Inverse Problems for Parabolic Systems with Dynamic Boundary Conditions | S. E. Chorfi et.al. | 2409.10302 | null |
2024-09-16 | On Synthetic Texture Datasets: Challenges, Creation, and Curation | Blaine Hoak et.al. | 2409.10297 | null |
2024-09-16 | ReflectDiffu: Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework | Jiahao Yuan et.al. | 2409.10289 | link |
2024-09-16 | DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis | Fa-Ting Hong et.al. | 2409.10281 | null |
2024-09-13 | Oxygen Abundance Throughout the Dwarf Starburst IC 10 | Maren Cosens et.al. | 2409.09020 | null |
2024-09-13 | Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation | Qingwen Bu et.al. | 2409.09016 | link |
2024-09-13 | Diffusion crossover from/to |
Antonio Rodríguez et.al. | 2409.08992 | null |
2024-09-13 | User Identity Linkage on Social Networks: A Review of Modern Techniques and Applications | Caterina Senette et.al. | 2409.08966 | null |
2024-09-13 | A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis | Yohan Poirier-Ginter et.al. | 2409.08947 | null |
2024-09-13 | Neural network Approximations for Reaction-Diffusion Equations -- Homogeneous Neumann Boundary Conditions and Long-time Integrations | Eddel Elí Ojeda Avilés et.al. | 2409.08941 | null |
2024-09-13 | Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation | Guojun Liang et.al. | 2409.08917 | link |
2024-09-13 | Tracing the impacts of Mount Pinatubo eruption on global climate using spatially-varying changepoint detection | Samantha Shi-Jun et.al. | 2409.08908 | null |
2024-09-13 | Gaussian is All You Need: A Unified Framework for Solving Inverse Problems via Diffusion Posterior Sampling | Nebiyou Yismaw et.al. | 2409.08906 | null |
2024-09-13 | Quantitative propagation of chaos for non-exchangeable diffusions via first-passage percolation | Daniel Lacker et.al. | 2409.08882 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-12 | Click2Mask: Local Editing with Dynamic Mask Generation | Omer Regev et.al. | 2409.08272 | null |
2024-09-12 | DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer | Runjia Li et.al. | 2409.08271 | null |
2024-09-12 | Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation | Samanta Rodriguez et.al. | 2409.08269 | null |
2024-09-12 | Improving Text-guided Object Inpainting with Semantic Pre-inpainting | Yifu Chen et.al. | 2409.08260 | link |
2024-09-12 | Improving Virtual Try-On with Garment-focused Diffusion Models | Siqi Wan et.al. | 2409.08258 | link |
2024-09-12 | LoRID: Low-Rank Iterative Diffusion for Adversarial Purification | Geigh Zollicoffer et.al. | 2409.08255 | null |
2024-09-12 | Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding | Hongyu Li et.al. | 2409.08251 | null |
2024-09-12 | IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation | Yinwei Wu et.al. | 2409.08240 | null |
2024-09-12 | Structural and electronic transformations in TiO2 induced by electric current | Tyler C. Sterlinga et.al. | 2409.08223 | null |
2024-09-11 | DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang et.al. | 2409.07454 | null |
2024-09-11 | Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models | Haibo Yang et.al. | 2409.07452 | link |
2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451 | null |
2024-09-11 | StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos | Sijie Zhao et.al. | 2409.07447 | null |
2024-09-11 | Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering | Dafei Qin et.al. | 2409.07441 | null |
2024-09-11 | Dirichlet metric measure spaces: spectrum, irreducibility, and small deviations | Marco Carfagnini et.al. | 2409.07425 | null |
2024-09-11 | Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging | Yunzhen Wang et.al. | 2409.07417 | null |
2024-09-11 | PRIME: Phase Reversed Interleaved Multi-Echo acquisition enables highly accelerated distortion-free diffusion MRI | Yohan Jun et.al. | 2409.07375 | null |
2024-09-11 | Finite element approximation of stationary Fokker--Planck--Kolmogorov equations with application to periodic numerical homogenization | Timo Sprekeler et.al. | 2409.07371 | null |
2024-09-11 | Training-Free Guidance for Discrete Diffusion Models for Molecular Generation | Thomas J. Kerby et.al. | 2409.07359 | null |
2024-09-10 | SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation | Teng Hu et.al. | 2409.06633 | null |
2024-09-10 | A surprising regularizing effect of the nonlinear semigroup associated to the semilinear heat equation and applications to reaction diffusion systems | Said Kouachi et.al. | 2409.06606 | null |
2024-09-10 | Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models | Xin Jing et.al. | 2409.06451 | null |
2024-09-10 | Spectral Map for Slow Collective Variables, Markovian Dynamics, and Transition State Ensembles | Jakub Rydzewski et.al. | 2409.06428 | null |
2024-09-10 | The Potential of Geminate Pairs in Lead Halide Perovskite revealed via Time-resolved Photoluminescence | Hannes Hempel et.al. | 2409.06382 | null |
2024-09-10 | Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition | Junzheng Zhang et.al. | 2409.06371 | null |
2024-09-10 | Hydrodynamic model for laser swelling | Nikita Bityurin et.al. | 2409.06370 | null |
2024-09-10 | What happens to diffusion model likelihood when your model is conditional? | Mattias Cross et.al. | 2409.06364 | null |
2024-09-10 | DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement | Jia-Wei Liao et.al. | 2409.06355 | null |
2024-09-10 | Market Reaction to News Flows in Supply Chain Networks | Hiroyasu Inoue et.al. | 2409.06255 | null |
2024-09-09 | Quantum maximum entropy closure for small flavor coherence | Julien Froustey et.al. | 2409.05807 | null |
2024-09-09 | Enhancing Preference-based Linear Bandits via Human Response Time | Shen Li et.al. | 2409.05798 | null |
2024-09-09 | Vector Quantized Diffusion Model Based Speech Bandwidth Extension | Yuan Fang et.al. | 2409.05784 | null |
2024-09-09 | Effects of Interfacial Oxygen Diffusion on the Magnetic Properties and Thermal Stability of Pd/CoFeB/Pd/Ta Heterostructure | Saravanan Lakshmanan et.al. | 2409.05783 | null |
2024-09-09 | AS-Speech: Adaptive Style For Speech Synthesis | Zhipeng Li et.al. | 2409.05730 | null |
2024-09-09 | Thermalization And Convergence To Equilibrium Of The Noisy Voter Model | Enzo Aljovin et.al. | 2409.05722 | null |
2024-09-09 | pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning | Jiahao Lai et.al. | 2409.05701 | null |
2024-09-09 | LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow | Hongyu Wen et.al. | 2409.05688 | null |
2024-09-09 | Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models | Aakash Sen Sharma et.al. | 2409.05668 | null |
2024-09-09 | Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Zhao Shan et.al. | 2409.05622 | null |
2024-09-06 | VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation | Yecheng Wu et.al. | 2409.04429 | link |
2024-09-06 | Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques | Davide Clode da Silva et.al. | 2409.04424 | null |
2024-09-06 | Empirical Bayesian image restoration by Langevin sampling with a denoising diffusion implicit prior | Charlesquin Kemajou Mbakam et.al. | 2409.04384 | null |
2024-09-06 | Exploring nuclear structure with multiparticle azimuthal correlations at the LHC | ALICE Collaboration et.al. | 2409.04343 | null |
2024-09-06 | How Fair is Your Diffusion Recommender Model? | Daniele Malitesta et.al. | 2409.04339 | null |
2024-09-06 | Random effects estimation in a fractional diffusion model based on continuous observations | Nesrine Chebli et.al. | 2409.04331 | null |
2024-09-06 | Time-dependent dynamics in the confined lattice Lorentz gas | A. Squarcini et.al. | 2409.04293 | null |
2024-09-06 | Dimensional crossover via confinement in the lattice Lorentz gas | A. Squarcini et.al. | 2409.04289 | null |
2024-09-06 | Electromagnetic field assisted exciton diffusion in moiré superlattices | A. M. Shentsev et.al. | 2409.04284 | null |
2024-09-06 | Breaking the Brownian Barrier: Models and Manifestations of Molecular Diffusion in Complex Fluids | Harish Srinivasan et.al. | 2409.04199 | null |
2024-09-05 | Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Yunze Man et.al. | 2409.03757 | link |
2024-09-05 | DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation | Wenliang Zhao et.al. | 2409.03755 | link |
2024-09-05 | ArtiFade: Learning to Generate High-quality Subject from Blemished Images | Shuya Yang et.al. | 2409.03745 | null |
2024-09-05 | Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation | Slava Elizarov et.al. | 2409.03718 | null |
2024-09-05 | RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images | Benzhi Wang et.al. | 2409.03644 | link |
2024-09-05 | DiffEVC: Any-to-Any Emotion Voice Conversion with Expressive Guidance | Hsing-Hang Chou et.al. | 2409.03636 | null |
2024-09-05 | Critical transition between intensive and extensive active droplets | Jonathan Bauermann et.al. | 2409.03629 | null |
2024-09-05 | TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces | Bernardo Biesseck et.al. | 2409.03600 | link |
2024-09-05 | Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Cord Paralysis | Yucong Zhang et.al. | 2409.03597 | null |
2024-09-05 | Curvature dependent dynamics of a bacterium confined in a giant unilamellar vesicle | Olivia Vincent et.al. | 2409.03578 | null |
2024-09-04 | HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts | Xinyu Liu et.al. | 2409.02919 | link |
2024-09-04 | Gravitational radiation from binary systems in Unimodular gravity | Indranil Chakraborty et.al. | 2409.02909 | null |
2024-09-04 | Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling | Kaiwen Zheng et.al. | 2409.02908 | null |
2024-09-04 | Toward 2D Dynamo Models Calibrated by Global 3D Relativistic Accretion Disk Simulations | Matthew D. Duez et.al. | 2409.02899 | null |
2024-09-04 | The Impact of Balancing Real and Synthetic Data on Accuracy and Fairness in Face Recognition | Andrea Atzori et.al. | 2409.02867 | null |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model | Tornike Karchkhadze et.al. | 2409.02845 | null |
2024-09-04 | Segregation in binary mixture with differential contraction among active rings | Emanuel F. Teixeira et.al. | 2409.02814 | null |
2024-09-04 | Microstructural features and hydrogen diffusion in bcc FeCr alloys: a comparison between the Kelvin probe- and nanohardness based- methods | Jing Rao et.al. | 2409.02787 | null |
2024-09-04 | Oxygen Isotope Exchange Between Dust Aggregates and Ambient Nebular Gas | Sota Arakawa et.al. | 2409.02736 | null |
2024-08-30 | CinePreGen: Camera Controllable Video Previsualization via Engine-powered Diffusion | Yiran Chen et.al. | 2408.17424 | null |
2024-08-30 | High-order finite element methods for three-dimensional multicomponent convection-diffusion | Aaron Baier-Reinio et.al. | 2408.17390 | link |
2024-08-30 | An enhanced version of the Gaia map of the brightness of the natural sky | Eduard Masana et.al. | 2408.17371 | null |
2024-08-30 | Dimensional confinement and superdiffusive rotational motion of uniaxial colloids in the presence of cylindrical obstacles | Vikki Anand Varma et.al. | 2408.17345 | null |
2024-08-30 | Subspace Diffusion Posterior Sampling for Travel-Time Tomography | Xiang Cao et.al. | 2408.17333 | null |
2024-08-30 | Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution | Yixin Wu et.al. | 2408.17285 | null |
2024-08-30 | Likelihood estimation for stochastic differential equations with mixed effects | Fernando Baltazar-Larios et.al. | 2408.17257 | null |
2024-08-30 | A kinetic chemotaxis model and its diffusion limit in slab geometry | Herbert Egger et.al. | 2408.17243 | null |
2024-08-30 | Shock-driven amorphization and melt in Fe $_2$O$_3$ | Céline Crépisson et.al. | 2408.17204 | null |
2024-08-30 | Excitation and spatial study of a prestellar cluster towards G+0.693-0.027 in the Galactic centre | L. Colzi et.al. | 2408.17141 | null |
2024-08-29 | ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model | Fangfu Liu et.al. | 2408.16767 | null |
2024-08-29 | CSGO: Content-Style Composition in Text-to-Image Generation | Peng Xing et.al. | 2408.16766 | null |
2024-08-29 | A Score-Based Density Formula, with Applications in Diffusion Generative Models | Gen Li et.al. | 2408.16765 | null |
2024-08-29 | UV-free Texture Generation with Denoising and Geodesic Heat Diffusions | Simone Foti et.al. | 2408.16762 | link |
2024-08-29 | Non-detection of Neutrinos from the BOAT: Improved Constraints on the Parameters of GRB 221009A | P. Veres et.al. | 2408.16748 | null |
2024-08-29 | A VLA Study of Newly-Discovered Southern Latitude Non-Thermal Filaments in the Galactic Center: Polarimetric and Magnetic Field Properties | Dylan M. Pare et.al. | 2408.16745 | null |
2024-08-29 | Porous medium type reaction-diffusion equation: large time behaviors and regularity of free boundary | Qingyou He et.al. | 2408.16718 | null |
2024-08-29 | Hydrogen reaction rate modeling based on convolutional neural network for large eddy simulation | Quentin Malé et.al. | 2408.16709 | null |
2024-08-29 | One-Shot Learning Meets Depth Diffusion in Multi-Object Videos | Anisha Jain et.al. | 2408.16704 | null |
2024-08-29 | DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Yongjie Fu et.al. | 2408.16647 | null |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995 | null |
2024-08-28 | Distribution Backtracking Builds A Faster Convergence Trajectory for One-step Diffusion Distillation | Shengyuan Zhang et.al. | 2408.15991 | link |
2024-08-28 | Direct measurement of surface interactions experienced by sticky microcapsules made from environmentally benign materials | Hairou Yu et.al. | 2408.15945 | null |
2024-08-28 | DiffAge3D: Diffusion-based 3D-aware Face Aging | Junaid Wahid et.al. | 2408.15922 | null |
2024-08-28 | Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones | Carlos Plou et.al. | 2408.15899 | null |
2024-08-28 | Airfoil Diffusion: Denoising Diffusion Model For Conditional Airfoil Generation | Reid Graves et.al. | 2408.15898 | link |
2024-08-28 | Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data | Ayodeji Ijishakin et.al. | 2408.15890 | null |
2024-08-28 | GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Yongjie Fu et.al. | 2408.15868 | null |
2024-08-28 | Global well-posedness and large time behavior of solutions to the compressible Oldroyd-B model without stress diffusion | Yajuan Zhao et.al. | 2408.15812 | null |
2024-08-28 | Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor Attacks | Oscar Chew et.al. | 2408.15721 | null |
2024-08-27 | GenRec: Unifying Video Generation and Recognition with Diffusion Models | Zejia Weng et.al. | 2408.15241 | link |
2024-08-27 | Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation | Xiaojuan Wang et.al. | 2408.15239 | null |
2024-08-27 | Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment | Xuan Xu et.al. | 2408.15218 | null |
2024-08-27 | On latent dynamics learning in nonlinear reduced order modeling | Nicola Farenga et.al. | 2408.15183 | null |
2024-08-27 | Simulation of Stochastic Discrete Dislocation Dynamics in Ductile Vs Brittle Materials | Santosh Chhetri et.al. | 2408.15157 | null |
2024-08-27 | Linearization of finite-strain poro-visco-elasticity with degenerate mobility | Willem J. M. van Oosterhout et.al. | 2408.15151 | null |
2024-08-27 | DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays | Yiran Sun et.al. | 2408.15118 | link |
2024-08-27 | Constrained Diffusion Models via Dual Training | Shervin Khalafi et.al. | 2408.15094 | null |
2024-08-27 | Conditioning the logistic continuous-state branching process on non-extinction via its total progeny | Clément Foucart et.al. | 2408.14993 | null |
2024-08-27 | LN-Gen: Rectal Lymph Nodes Generation via Anatomical Features | Weidong Guo et.al. | 2408.14977 | null |
2024-08-26 | Relativistic spin hydrodynamics with momentum and spin-dependent relaxation time | Samapan Bhadury et.al. | 2408.14462 | null |
2024-08-26 | An optimization-based coupling of reduced order models with efficient reduced adjoint basis generation approach | Elizabeth Hawkins et.al. | 2408.14450 | null |
2024-08-27 | DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance | Jinhyeok Yang et.al. | 2408.14423 | null |
2024-08-26 | Consistent diffusion matrix estimation from population time series | Aden Forrow et.al. | 2408.14408 | link |
2024-08-26 | Application of Neural Ordinary Differential Equations for ITER Burning Plasma Dynamics | Zefang Liu et.al. | 2408.14404 | link |
2024-08-26 | GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy | Peiyan Li et.al. | 2408.14368 | link |
2024-08-26 | From irregular to regular eutectic growth in the Al-Al3Ni system: in situ observations during directional solidification | Paul Chao et.al. | 2408.14346 | null |
2024-08-26 | On the origin of the |
Mukesh Singh Bisht et.al. | 2408.14344 | null |
2024-08-27 | Foundation Models for Music: A Survey | Yinghao Ma et.al. | 2408.14340 | link |
2024-08-26 | Streamline tractography of the fetal brain in utero with machine learning | Weide Liu et.al. | 2408.14326 | link |
2024-08-23 | How Diffusion Models Learn to Factorize and Compose | Qiyao Liang et.al. | 2408.13256 | null |
2024-08-23 | LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation | Shuai Yang et.al. | 2408.13252 | null |
2024-08-23 | CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities | Tao Wu et.al. | 2408.13239 | link |
2024-08-23 | IFH: a Diffusion Framework for Flexible Design of Graph Generative Models | Samuel Cognolato et.al. | 2408.13194 | link |
2024-08-23 | Optimal order time discretizations for stochastic semilinear wave equations with multiplicative noise | Xiaobing Feng et.al. | 2408.13134 | null |
2024-08-23 | Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning | Jihwan Oh et.al. | 2408.13092 | null |
2024-08-23 | Turbulent convection in emulsions: the Rayleigh-Bénard configuration | Abbas Moradi Bilondi et.al. | 2408.13087 | null |
2024-08-23 | General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model | Weiru Fan et.al. | 2408.13061 | null |
2024-08-23 | Atlas Gaussians Diffusion for 3D Generation with Infinite Number of Points | Haitao Yang et.al. | 2408.13055 | null |
2024-08-23 | Adaptive complexity of log-concave sampling | Huanjian Zhou et.al. | 2408.13045 | null |
2024-08-22 | Very Extended Ionized Gas Discovered around NGC 1068 with the Circumgalactic H |
Nicole Melso et.al. | 2408.12597 | null |
2024-08-22 | xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations | Can Qin et.al. | 2408.12590 | null |
2024-08-22 | Real-Time Video Generation with Pyramid Attention Broadcast | Xuanlei Zhao et.al. | 2408.12588 | link |
2024-08-22 | ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation | Lujia Zhong et.al. | 2408.12561 | link |
2024-08-23 | Detecting random bifurcations via rigorous enclosures of large deviations rate functions | Alexandra Blessing et.al. | 2408.12556 | null |
2024-08-22 | Neural Fields and Noise-Induced Patterns in Neurons on Large Disordered Networks | Daniele Avitabile et.al. | 2408.12540 | null |
2024-08-22 | Spectral eigenfunction decomposition of a Fokker-Planck operator for relativistic heavy-ion collisions | A. Rizzi et.al. | 2408.12532 | null |
2024-08-22 | Show-o: One Single Transformer to Unify Multimodal Understanding and Generation | Jinheng Xie et.al. | 2408.12528 | null |
2024-08-22 | FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing | Jue Wang et.al. | 2408.12429 | link |
2024-08-22 | 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment | Kaihui Cheng et.al. | 2408.12419 | null |
2024-08-21 | Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models | Chun-Yen Shih et.al. | 2408.11810 | null |
2024-08-21 | Timeline and Boundary Guided Diffusion Network for Video Shadow Detection | Haipeng Zhou et.al. | 2408.11785 | link |
2024-08-21 | Do We Really Need to Drop Items with Missing Modalities in Multimodal Recommendation? | Daniele Malitesta et.al. | 2408.11767 | link |
2024-08-21 | Modeling multiband SEDs and light curves of BL Lacertae using a time-dependent shock-in-jet model | Rukaiya Khatoon et.al. | 2408.11763 | null |
2024-08-21 | JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet | Yujia Gu et.al. | 2408.11744 | null |
2024-08-21 | Iterative Object Count Optimization for Text-to-image Diffusion Models | Oz Zafar et.al. | 2408.11721 | null |
2024-08-21 | FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting | Liyao Jiang et.al. | 2408.11706 | null |
2024-08-21 | The off-equilibrium Kinetic Ising model: The Metric Case | Luca Di Carlo et.al. | 2408.11690 | null |
2024-08-21 | Plug-in estimation of Schrödinger bridges | Aram-Alexandre Pooladian et.al. | 2408.11686 | link |
2024-08-21 | Reconstruction of reverberation theory in a diffuse sound field by using reflection orders | Toshiki Hanyu et.al. | 2408.11670 | null |
2024-08-20 | Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model | Chunting Zhou et.al. | 2408.11039 | null |
2024-08-20 | A new perspective on the learning dynamics for a class of learning problems via averaged gradient systems coupled with diffusion-transmutation processes | Getachew K. Befekadu et.al. | 2408.11005 | null |
2024-08-20 | MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning | Haoning Wu et.al. | 2408.11001 | link |
2024-08-20 | Denoising Plane Wave Ultrasound Images Using Diffusion Probabilistic Models | Hojat Asgariandehkordi et.al. | 2408.10987 | null |
2024-08-20 | GreediRIS: Scalable Influence Maximization using Distributed Streaming Maximum Cover | Reet Barik et.al. | 2408.10982 | null |
2024-08-21 | Monte Carlo Physics-informed neural networks for multiscale heat conduction via phonon Boltzmann transport equation | Qingyi Lin et.al. | 2408.10965 | null |
2024-08-20 | Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling | Jaideep Pathak et.al. | 2408.10958 | null |
2024-08-20 | Large Point-to-Gaussian Model for Image-to-3D Generation | Longfei Lu et.al. | 2408.10935 | null |
2024-08-20 | A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse | Zhongliang Guo et.al. | 2408.10901 | null |
2024-08-20 | Radio U-Net: a convolutional neural network to detect diffuse radio sources in galaxy clusters and beyond | Chiara Stuardi et.al. | 2408.10871 | null |
2024-08-19 | MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model | Minghua Liu et.al. | 2408.10198 | null |
2024-08-19 | SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views | Chao Xu et.al. | 2408.10195 | null |
2024-08-19 | Evaluation of the eddy diffusivity in a pollutant dispersion model in the planetary boundary layer | A. Goulart et.al. | 2408.10168 | null |
2024-08-19 | Stacking Polymorphism of PtSe |
Jeonghwan Ahn et.al. | 2408.10156 | null |
2024-08-19 | Solution landscape of reaction-diffusion systems reveals a nonlinear mechanism and spatial robustness of pattern formation | Shuonan Wu et.al. | 2408.10095 | null |
2024-08-19 | Ocean Circulation on Tide-locked Lava Worlds, Part II: Scalings | Yanhong Lai et.al. | 2408.09985 | null |
2024-08-19 | Multi-layer diffusion model of photovoltaic installations | Tomasz Weron et.al. | 2408.09904 | null |
2024-08-19 | Instruction-Based Molecular Graph Generation with Unified Text-Graph Diffusion Model | Yuran Xiang et.al. | 2408.09896 | link |
2024-08-19 | Transport coefficients of the heavy quark in the domain of the non-perturbative and non-eikonal gluon radiation | Surasree Mazumder et.al. | 2408.09824 | null |
2024-08-19 | SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models | Danush Kumar Venkatesh et.al. | 2408.09822 | link |
2024-08-16 | Homogenization of Poisson-Nernst-Planck equations for multiple species in a porous medium | Apratim Bhattacharya et.al. | 2408.08831 | null |
2024-08-16 | PFDiff: Training-free Acceleration of Diffusion Models through the Gradient Guidance of Past and Future | Guangyi Wang et.al. | 2408.08822 | null |
2024-08-16 | Accurate wave velocity measurement from diffuse wave fields | Melody Png et.al. | 2408.08756 | null |
2024-08-16 | Comparative Analysis of Generative Models: Enhancing Image Synthesis with VAEs, GANs, and Stable Diffusion | Sanchayan Vivekananthan et.al. | 2408.08751 | null |
2024-08-16 | Photoluminescence decay of mobile carriers influenced by imperfect quenching at particle surfaces with subdiffusive spread | Ryuzi Katoh et.al. | 2408.08692 | null |
2024-08-16 | An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation | Peiming Guo et.al. | 2408.08650 | null |
2024-08-16 | Modeling the Neonatal Brain Development Using Implicit Neural Representations | Florentin Bieder et.al. | 2408.08647 | link |
2024-08-16 | Sampling effects on Lasso estimation of drift functions in high-dimensional diffusion processes | Chiara Amorino et.al. | 2408.08638 | null |
2024-08-16 | Reference-free Axial Super-resolution of 3D Microscopy Images using Implicit Neural Representation with a 2D Diffusion Prior | Kyungryun Lee et.al. | 2408.08616 | link |
2024-08-16 | Generative Dataset Distillation Based on Diffusion Model | Duo Su et.al. | 2408.08610 | link |
2024-08-15 | Understanding the Local Geometry of Generative Model Manifolds | Ahmed Imtiaz Humayun et.al. | 2408.08307 | null |
2024-08-15 | Accelerated Image-Aware Generative Diffusion Modeling | Tanmay Asthana et.al. | 2408.08306 | null |
2024-08-15 | Solutions and stochastic averaging for delay-path-dependent stochastic variational inequalities in infinite dimensions | Ning Ning et.al. | 2408.08277 | null |
2024-08-15 | Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding | Xiner Li et.al. | 2408.08252 | link |
2024-08-15 | Probing hydrodynamic crossovers with dissipation-assisted operator evolution | N. S. Srivatsa et.al. | 2408.08249 | null |
2024-08-15 | Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion | Adi Haviv et.al. | 2408.08184 | null |
2024-08-15 | Study of non-diffusive thermal behaviors in nanoscale transistors under different heating strategies | Chuang Zhang et.al. | 2408.08120 | null |
2024-08-15 | Exploring Uncertainty Visualization for Degenerate Tensors in 3D Symmetric Second-Order Tensor Field Ensembles | Tadea Schmitz et.al. | 2408.08099 | link |
2024-08-15 | Transport and mixing in control volumes through the lens of probability | John Craske et.al. | 2408.08028 | null |
2024-08-15 | Gravitational Lensing Reveals Cool Gas within 10-20 kpc around a Quiescent Galaxy | Tania M. Barone et.al. | 2408.07984 | null |
2024-08-14 | Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding | Bing Hu et.al. | 2408.07636 | null |
2024-08-14 | Anisotropic Diffusion Model of Communication in 2D Biofilm | Yanahan Paramalingam et.al. | 2408.07626 | null |
2024-08-14 | PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation | Sang-Hoon Lee et.al. | 2408.07547 | link |
2024-08-14 | DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model | Erez Yosef et.al. | 2408.07541 | null |
2024-08-15 | DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution | Yuanbo Zhou et.al. | 2408.07516 | null |
2024-08-14 | Front propagation in hybrid reaction-diffusion epidemic models with spatial heterogeneity | Quentin Griette et.al. | 2408.07501 | null |
2024-08-14 | DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency | Xiaojing Zhong et.al. | 2408.07481 | null |
2024-08-14 | One Step Diffusion-based Super-Resolution with Time-Aware Distillation | Xiao He et.al. | 2408.07476 | link |
2024-08-14 | Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models | Jean-Marie Lemercier et.al. | 2408.07472 | null |
2024-08-14 | Diffuse Interface Model for Two-Phase Flows on Evolving Surfaces with Different Densities: Global Well-Posedness | Helmut Abels et.al. | 2408.07449 | null |
2024-08-13 | Imagen 3 | Imagen-Team-Google et.al. | 2408.07009 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-13 | DCMSA: Multi-Head Self-Attention Mechanism Based on Deformable Convolution For Seismic Data Denoising | Wang Mingwei et.al. | 2408.06963 | null |
2024-08-13 | Spherical-oblate shape coexistence in |
N. Marchini et.al. | 2408.06940 | null |
2024-08-13 | Global well-posedness of the 3D primitive equations with horizontal viscosity and vertical diffusivity II: close to |
Chongsheng Cao et.al. | 2408.06932 | null |
2024-08-13 | Diffusion Model for Slate Recommendation | Federico Tomasi et.al. | 2408.06883 | null |
2024-08-13 | Dwellers in the Deep: Biological Consequences of Dark Oxygen | Manasvi Lingam et.al. | 2408.06841 | null |
2024-08-13 | Geotree of Geodetector: An Anatomy of Knowledge Diffusion of a Novel Statistic | Yuting Liang et.al. | 2408.06839 | null |
2024-08-13 | Extreme events in locally coupled bursting neurons | Ardhanareeswaran R Sree et.al. | 2408.06805 | null |
2024-08-13 | DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion | Yujia Wu et.al. | 2408.06740 | null |
2024-08-12 | The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery | Chris Lu et.al. | 2408.06292 | link |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-08-12 | Singular limit and convergence rate via projection method in a model for plant-growth dynamics with autotoxicity | Jeff Morgan et.al. | 2408.06177 | null |
2024-08-12 | Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance | Taewon Kang et.al. | 2408.06157 | null |
2024-08-12 | Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models | Ioannis Romanelis et.al. | 2408.06145 | link |
2024-08-12 | CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer | Zhuoyi Yang et.al. | 2408.06072 | link |
2024-08-12 | ControlNeXt: Powerful and Efficient Control for Image and Video Generation | Bohao Peng et.al. | 2408.06070 | link |
2024-08-12 | BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training | Xuanpu Zhang et.al. | 2408.06047 | link |
2024-08-12 | Gradient flow for a class of diffusion equations with Dirichlet boundary data | Matthias Erbar et.al. | 2408.05987 | null |
2024-08-12 | Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models | Haifan Gong et.al. | 2408.05985 | null |
2024-08-09 | Multi-Garment Customized Model Generation | Yichen Liu et.al. | 2408.05206 | null |
2024-08-09 | Galactic Gas Models Strongly Affect the Determination of the Diffusive Halo Height | Pedro De La Torre Luque et.al. | 2408.05179 | null |
2024-08-09 | Hidden curved spaces in Bosonic Kitaev model | Chenwei Lv et.al. | 2408.05132 | null |
2024-08-09 | Multi-dimensional Parameter Space Exploration for Streamline-specific Tractography | Ruben Vink et.al. | 2408.05056 | null |
2024-08-09 | Numerical simulation and analysis of mixing enhancement due to chaotic advection using an adaptive approach for approximating the dilution index | Carla Feistner et.al. | 2408.05055 | null |
2024-08-09 | Optical observations of the Galactic SNR HB9 and H II region G159.2+3.3 | Jiang-Tao Li et.al. | 2408.05016 | null |
2024-08-09 | Nanoroughness induced anti-reflection and haze effects in opaque systems | V. Gareyan et.al. | 2408.05014 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | Instability of the engineered dark state in two-band fermions under number-conserving dissipative dynamics | A. A. Lyublinskaya et.al. | 2408.04987 | null |
2024-08-09 | Solar poloidal magnetic field generation rate from observations and mean-field dynamos | V. V. Pipin et.al. | 2408.04934 | null |
2024-08-08 | Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics | Ruining Li et.al. | 2408.04631 | null |
2024-08-08 | Regularized Unconstrained Weakly Submodular Maximization | Yanhui Zhu et.al. | 2408.04620 | null |
2024-08-08 | An empirical background modeling tool (TweedleDEE) applied to new Milky Way satellite Leo VI | Chance Hoskinson et.al. | 2408.04611 | link |
2024-08-09 | Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models | Qirui Jiao et.al. | 2408.04594 | link |
2024-08-08 | Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches | Yongzhi Xu et.al. | 2408.04567 | null |
2024-08-08 | Local and global existence for the stochastic Prandtl equation driven by multiplicative noises in two and three dimensions | Ya-Guang Wang et.al. | 2408.04546 | null |
2024-08-09 | Electrical resistivity, thermal conductivity, and viscosity of Fe-H alloys at Earth's core conditions | Cong Liu et.al. | 2408.04521 | null |
2024-08-08 | Diffusive hydrodynamics from long-range correlations | Friedrich Hübner et.al. | 2408.04502 | null |
2024-08-08 | Extrinsic Orbital Hall Effect: Interplay Between Diffusive and Intrinsic Transport | Alessandro Veneri et.al. | 2408.04492 | null |
2024-08-08 | Random Walk Diffusion for Efficient Large-Scale Graph Generation | Tobias Bernecker et.al. | 2408.04461 | null |
2024-08-07 | Dynamical patterns in active-passive particle mixtures with non-reciprocal interactions: Exact hydrodynamic analysis | James Mason et.al. | 2408.03932 | null |
2024-08-07 | Fluctuation of coherences in noisy mesoscopic quantum systems with diffusive transport | Ludwig Hruza et.al. | 2408.03917 | null |
2024-08-07 | Disorder-induced delocalization and reentrance in a Chern-Hopf insulator | Soumya Bera et.al. | 2408.03908 | null |
2024-08-07 | A first-order hyperbolic reformulation of the Cahn-Hilliard equation | Firas Dhaouadi et.al. | 2408.03862 | null |
2024-08-07 | Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model | Guoqing Zhu et.al. | 2408.03748 | link |
2024-08-07 | Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling | Jian Xu et.al. | 2408.03746 | null |
2024-08-07 | Unsupervised Detection of Fetal Brain Anomalies using Denoising Diffusion Models | Markus Ditlev Sjøgren Olsen et.al. | 2408.03654 | null |
2024-08-07 | Superdiffusion of energetic particles at shocks: A Lévy Flight model for acceleration | Sophie Aerdker et.al. | 2408.03638 | null |
2024-08-07 | TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization | Kien T. Pham et.al. | 2408.03637 | null |
2024-08-07 | "The Strength of Weak Ties" Varies Across Viral Channels | Shan Huang et.al. | 2408.03579 | null |
2024-08-06 | MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation | Xiaofeng Mao et.al. | 2408.03312 | null |
2024-08-06 | Dwarf Galaxies in the MATLAS Survey: Hubble Space Telescope Observations of the Globular Cluster Systems of 74 Ultra Diffuse Galaxies | Francine R. Marleau et.al. | 2408.03311 | null |
2024-08-06 | TextIM: Part-aware Interactive Motion Synthesis from Text | Siyuan Fan et.al. | 2408.03302 | null |
2024-08-06 | Photonic Mpemba effect | Stefano Longhi et.al. | 2408.03296 | null |
2024-08-06 | Ultra-thin strain-relieving Si $_{1-x}$Ge$_x$ layers enabling III-V epitaxy on Si | Trevor R. Smith et.al. | 2408.03253 | null |
2024-08-06 | Optimizing Density Functional Theory for Strain-Dependent Magnetic Properties of MnBi $_2$Te$_4$ with Diffusion Monte Carlo | Swarnava Ghosh et.al. | 2408.03248 | null |
2024-08-06 | Image Quality Transfer of Diffusion MRI Guided By High-Resolution Structural MRI | Alp G. Cicimen et.al. | 2408.03216 | null |
2024-08-06 | IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts | Ciara Rowles et.al. | 2408.03209 | null |
2024-08-06 | Phase field simulations of thermal annealing for all-small molecule organic solar cells | Yasin Ameslon et.al. | 2408.03190 | null |
2024-08-06 | Propagation phenomena for a reaction-diffusion-advection model in a heterogeneous environment | Xing Liang et.al. | 2408.03187 | null |
2024-08-05 | Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining | Dongyang Liu et.al. | 2408.02657 | link |
2024-08-05 | Ionic-electronic transistors small signal AC admittance: Theory and experiment | Juan Bisquert et.al. | 2408.02648 | null |
2024-08-05 | Learning the Latent dynamics of Fluid flows from High-Fidelity Numerical Simulations using Parsimonious Diffusion Maps | Alessandro Della Pia et.al. | 2408.02630 | null |
2024-08-05 | A Reverse Non-Equilibrium Molecular Dynamics (RNEMD) Algorithm for Coupled Mass and Heat Transport in Mixtures | Cody R. Drisko et.al. | 2408.02621 | null |
2024-08-05 | LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba | Yunxiang Fu et.al. | 2408.02615 | link |
2024-08-05 | Superradiant emission stimulated by vortex-antivortex pair production in layered superconductors | Ahmad Sheikhzada et.al. | 2408.02610 | null |
2024-08-05 | Rossby wave instability in weakly ionized protoplanetary disks. II. radial B-fields | Can Cui et.al. | 2408.02556 | null |
2024-08-05 | Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models | Tongtong Feng et.al. | 2408.02408 | null |
2024-08-05 | Networks of Pendula with Diffusive Interactions | Riccardo Bonetto et.al. | 2408.02352 | null |
2024-08-05 | Nonlocal particle approximation for linear and fast diffusion equations | José Antonio Carrillo et.al. | 2408.02345 | null |
2024-08-02 | Conditional LoRA Parameter Generation | Xiaolong Jin et.al. | 2408.01415 | null |
2024-08-02 | Harmonized connectome resampling for variance in voxel sizes | Elyssa M. McMaster et.al. | 2408.01351 | null |
2024-08-02 | TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling | Dong Huo et.al. | 2408.01291 | null |
2024-08-02 | A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness | Lutao Jiang et.al. | 2408.01269 | null |
2024-08-02 | CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models | Kushal Kumar Jain et.al. | 2408.01233 | null |
2024-08-02 | Origin of unexpected weak Gilbert damping in the LSMO/Pt bilayer system | Pritam Das et.al. | 2408.01209 | null |
2024-08-02 | Dipole orientation reveals single-molecule interactions and dynamics on 2D crystals | Wei Guo et.al. | 2408.01207 | null |
2024-08-02 | Sustainable Diffusion-based Incentive Mechanism for Generative AI-driven Digital Twins in Industrial Cyber-Physical Systems | Jinbo Wen et.al. | 2408.01173 | null |
2024-08-02 | Machine learning topological energy braiding of non-Bloch bands | Shuwei Shi et.al. | 2408.01141 | null |
2024-08-02 | Inverse Raman scattering and the diffuse interstellar bands: an exploration of the systemic interconnections between spontaneous and inverse Raman scattering and extended red emission, Red Rectangle bands, and diffuse interstellar bands | Frederic Zagury et.al. | 2408.01103 | null |
2024-08-01 | Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang et.al. | 2408.00766 | null |
2024-08-01 | Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention | Susung Hong et.al. | 2408.00760 | link |
2024-08-01 | TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models | Gilad Deutch et.al. | 2408.00735 | null |
2024-08-01 | ISDE with logarithmic interaction and characteristic polynomials | Theodoros Assiotis et.al. | 2408.00717 | null |
2024-08-01 | MotionFix: Text-Driven 3D Human Motion Editing | Nikos Athanasiou et.al. | 2408.00712 | null |
2024-08-01 | Alpha-VI DeepONet: A prior-robust variational Bayesian approach for enhancing DeepONets with uncertainty quantification | Soban Nasir Lone et.al. | 2408.00681 | null |
2024-08-01 | Evaluation Metrics and Methods for Generative Models in the Wireless PHY Layer | Michael Baur et.al. | 2408.00634 | null |
2024-08-01 | Generalised BBGKY hierarchy for near-integrable dynamics | Leonardo Biagetti et.al. | 2408.00593 | null |
2024-08-01 | Conditional Independence in Stationary Diffusions | Tobias Boege et.al. | 2408.00583 | null |
2024-08-01 | Dimension reduction for large-scale stochastic systems with non-zero initial states and controlled diffusion | Martin Redmann et.al. | 2408.00581 | null |
2024-07-31 | Detecting, Explaining, and Mitigating Memorization in Diffusion Models | Yuxin Wen et.al. | 2407.21720 | link |
2024-07-31 | Dephasing-assisted transport in a tight-binding chain with a linear potential | Samuel L. Jacob et.al. | 2407.21715 | null |
2024-07-31 | Tora: Trajectory-oriented Diffusion Transformer for Video Generation | Zhenghao Zhang et.al. | 2407.21705 | link |
2024-07-31 | Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification | Xingchen Shi et.al. | 2407.21683 | null |
2024-07-31 | Charged-impurity free printing-based diffusion doping in molybdenum disulfide field-effect transistors | Inho Jeong et.al. | 2407.21678 | null |
2024-07-31 | Stable Perovskite Solar Cells via exfoliated graphite as an ion diffusion-blocking layer | Abdullah S. Alharbi et.al. | 2407.21662 | null |
2024-07-31 | Properties of the diffuse gas component in filaments detected in the Dianoga cosmological simulations | Samo Ilc et.al. | 2407.21636 | null |
2024-07-31 | VEGAS-SSS: Tracing Globular Cluster Populations in the Interacting NGC3640 Galaxy Group | Marco Mirabile et.al. | 2407.21620 | null |
2024-07-31 | A τ Matrix Based Approximate Inverse Preconditioning for Tempered Fractional Diffusion Equations | Xuan Zhang et.al. | 2407.21603 | null |
2024-07-31 | Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors | Shoujin Huang et.al. | 2407.21600 | null |
2024-07-30 | Matting by Generation | Zhixiang Wang et.al. | 2407.21017 | null |
2024-07-30 | Add-SD: Rational Generation without Manual Reference | Lingfeng Yang et.al. | 2407.21016 | link |
2024-07-30 | Analysis of Polarized Dust Emission from the First Flight of the SPIDER Balloon-Borne Telescope | SPIDER Collaboration et.al. | 2407.20982 | null |
2024-07-30 | Explicit solution to an optimal two-player switching game in infinite horizon | Brahim El Asri et.al. | 2407.20913 | null |
2024-07-30 | Simultaneous Multi-Slice Diffusion Imaging using Navigator-free Multishot Spiral Acquisition | Yuancheng Jiang et.al. | 2407.20904 | null |
2024-07-30 | Optimizing Charge Transport Simulation for Deep Learning Enhanced Spatial Resolution of the MÖNCH Detector | X. Xie et.al. | 2407.20841 | null |
2024-07-30 | Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks | Yunfeng Diao et.al. | 2407.20836 | null |
2024-07-30 | A second-order Mean Field Games model with controlled diffusion | Vincenzo Ignazio et.al. | 2407.20826 | null |
2024-07-30 | A discussion on the critical electric Rayleigh number for AC electrokinetic flow of binary fluids in a divergent microchannel | Jinan Pang et.al. | 2407.20803 | null |
2024-07-30 | Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning | Norman Di Palo et.al. | 2407.20798 | null |
2024-07-29 | Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing | Ekaterina Iakovleva et.al. | 2407.20232 | null |
2024-07-30 | cDVAE: Multimodal Generative Conditional Diffusion Guided by Variational Autoencoder Latent Embedding for Virtual 6D Phase Space Diagnostics | Alexander Scheinker et.al. | 2407.20218 | null |
2024-07-29 | On the leptonic contribution to the ultra high-energy diffuse gamma-ray background | Samy Kaci et.al. | 2407.20186 | null |
2024-07-29 | LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework | Zhenqi He et.al. | 2407.20172 | link |
2024-07-29 | Diffusion Feedback Helps CLIP See Better | Wenxuan Wang et.al. | 2407.20171 | link |
2024-07-29 | A unified framework for |
M. ten Eikelder et.al. | 2407.20145 | null |
2024-07-29 | DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models | Jing Yang et.al. | 2407.20141 | null |
2024-07-29 | Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Liyuan Mao et.al. | 2407.20109 | null |
2024-07-29 | Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations | Fangyijie Wang et.al. | 2407.20072 | link |
2024-07-29 | ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning | Delyan Boychev et.al. | 2407.20020 | link |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | null |
2024-07-26 | Asymptotic behavior of a diffused interface volume-preserving mean curvature flow | Matteo Bonforte et.al. | 2407.18868 | null |
2024-07-26 | Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment | Yuze Zheng et.al. | 2407.18854 | null |
2024-07-26 | Disentangling competing interactions in disordered materials using interaction space modelling | Ella M. Schmidt et.al. | 2407.18815 | null |
2024-07-26 | Locomotion of Active Polymerlike Worms in Porous Media | Rosa Sinaasappel et.al. | 2407.18805 | null |
2024-07-26 | Log-Concave Coupling for Sampling Neural Net Posteriors | Curtis McDonald et.al. | 2407.18802 | null |
2024-07-26 | Revision of calcium and scandium abundances in Am stars based on NLTE calculations and comparison with diffusion stellar evolution models | L. I. Mashonkina et.al. | 2407.18736 | null |
2024-07-26 | Global dynamics of a two-stage structured diffusive population model in time-periodic and spatially heterogeneous environments | H. M. Gueguezo et.al. | 2407.18669 | null |
2024-07-26 | Adversarial Robustification via Text-to-Image Diffusion Models | Daewon Choi et.al. | 2407.18658 | link |
2024-07-26 | Mean-field control of non exchangeable systems | Anna De Crescenzo et.al. | 2407.18635 | null |
2024-07-25 | RegionDrag: Fast Region-Based Image Editing with Diffusion Models | Jingyi Lu et.al. | 2407.18247 | null |
2024-07-25 | VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads | Orest Kupyn et.al. | 2407.18245 | link |
2024-07-25 | Roger Tribe et.al. | 2407.18212 | null | |
2024-07-25 | Chemically reactive and aging macromolecular mixtures II: Phase separation and coarsening | Ruoyao Zhang et.al. | 2407.18171 | null |
2024-07-25 | Solvability and optimal control of a multi-species Cahn-Hilliard-Keller-Segel tumor growth model | Pierluigi Colli et.al. | 2407.18162 | null |
2024-07-25 | Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images | Roberto Di Via et.al. | 2407.18125 | null |
2024-07-25 | Testing non-local gravity through Ultra-Diffuse Galaxies kinematics | Filippo Bouchè et.al. | 2407.18084 | null |
2024-07-25 | Equation of state of Bose gases beyond the universal regime | Marti Planasdemunt et.al. | 2407.18059 | null |
2024-07-25 | AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild | Junho Park et.al. | 2407.18034 | link |
2024-07-25 | Three-dimensional exponential mixing and ideal kinematic dynamo with randomized ABC flows | Michele Coti Zelati et.al. | 2407.18028 | null |
2024-07-24 | SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency | Yiming Xie et.al. | 2407.17470 | null |
2024-07-24 | Gender disparities in the dissemination and acquisition of scientific knowledge | Chiara Zappalà et.al. | 2407.17441 | null |
2024-07-24 | CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction | Paul Goyes-Peñafiel et.al. | 2407.17402 | link |
2024-07-24 | Kinetic theory applied to pressure-controlled shear flows of frictionless spheres between rigid, bumpy planes | Dalila Vescovi et.al. | 2407.17397 | null |
2024-07-24 | How optimal control of polar sea-ice depends on its tipping points | Parvathi Kooloth et.al. | 2407.17357 | null |
2024-07-24 | Optimal Control of a Reaction-Diffusion Epidemic Model with Noncompliance | Marcelo Bongarti et.al. | 2407.17298 | null |
2024-07-24 | Hybrid-PFC: coupling the phase-field crystal model and its amplitude-equation formulation | Maik Punke et.al. | 2407.17283 | null |
2024-07-24 | Stochastic Aggregation Diffusion-Equation : Analysis via Dirichlet Forms | Jaouad Bourabiaa et.al. | 2407.17239 | null |
2024-07-25 | LPGen: Enhancing High-Fidelity Landscape Painting Generation through Diffusion Model | Wanggong Yang et.al. | 2407.17229 | null |
2024-07-24 | Sublinear Regret for An Actor-Critic Algorithm in Continuous-Time Linear-Quadratic Reinforcement Learning | Yilie Huang et.al. | 2407.17226 | null |
2024-07-23 | Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Fabio Tosi et.al. | 2407.16698 | link |
2024-07-23 | DIISC-V: Variations in H |
Mansi Padave et.al. | 2407.16690 | null |
2024-07-23 | Uncountable Infinite Exact Solutions to the FitzHugh-Nagumo Model | Shahid Sultan Ali Ramji et.al. | 2407.16678 | null |
2024-07-23 | From Imitation to Refinement -- Residual RL for Precise Visual Assembly | Lars Ankile et.al. | 2407.16677 | null |
2024-07-23 | MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence | Canyu Zhao et.al. | 2407.16655 | null |
2024-07-23 | The role of meridional flow in the generation of solar/stellar magnetic fields and cycles | Vindya Vashishth et.al. | 2407.16620 | null |
2024-07-24 | Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music with Lightweight Finetuning | Fang-Duo Tsai et.al. | 2407.16564 | link |
2024-07-23 | Free boundary limits of coupled bulk-surface models for receptor-ligand interactions on evolving domains | Amal Alphonse et.al. | 2407.16522 | null |
2024-07-23 | DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models | Zhenyu Xie et.al. | 2407.16511 | null |
2024-07-23 | qMRI Diffusor: Quantitative T1 Mapping of the Brain using a Denoising Diffusion Probabilistic Model | Shishuai Wang et.al. | 2407.16477 | null |
2024-07-22 | Artist: Aesthetically Controllable Text-Driven Stylization without Training | Ruixiang Jiang et.al. | 2407.15842 | link |
2024-07-22 | Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget | Vikash Sehwag et.al. | 2407.15811 | null |
2024-07-22 | Diffusion Model Based Resource Allocation Strategy in Ultra-Reliable Wireless Networked Control Systems | Amirhassan Babazadeh Darabi et.al. | 2407.15784 | null |
2024-07-22 | A Hamilton-Jacobi approach to road-field reaction-diffusion models | Christopher Henderson et.al. | 2407.15760 | null |
2024-07-22 | Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond | Silvio Galesso et.al. | 2407.15739 | link |
2024-07-22 | Inverse problems for coupled nonlocal nonlinear systems arising in mathematical biology | Ming-Hui Ding et.al. | 2407.15713 | null |
2024-07-22 | Estimating Probability Densities with Transformer and Denoising Diffusion | Henry W. Leung et.al. | 2407.15703 | link |
2024-07-22 | Voltage mapping in subcellular nanodomains using electro-diffusion modeling | Frédéric Paquin-Lefebvre et.al. | 2407.15697 | null |
2024-07-22 | Particle Based Inference for Continuous-Discrete State Space Models | Christopher Stanton et.al. | 2407.15666 | null |
2024-07-22 | DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving | Jiahang Tu et.al. | 2407.15661 | link |
2024-07-19 | DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Sarah Jabbour et.al. | 2407.14509 | null |
2024-07-19 | M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models | Seunggeun Chi et.al. | 2407.14502 | null |
2024-07-19 | Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model | Seonghui Min et.al. | 2407.14434 | null |
2024-07-19 | Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models | Hyun-Jic Oh et.al. | 2407.14426 | null |
2024-07-19 | HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation | Zezeng Li et.al. | 2407.14419 | null |
2024-07-19 | Uniqueness of the inverse source problem for fractional diffusion-wave equations | Lingyun Qiu et.al. | 2407.14413 | null |
2024-07-19 | Natural convection in the cytoplasm: Theoretical predictions of buoyancy-driven flows inside a cell | Nikhil Desai et.al. | 2407.14385 | null |
2024-07-19 | As Generative Models Improve, People Adapt Their Prompts | Eaman Jahani et.al. | 2407.14333 | null |
2024-07-19 | Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model | Kun Zhao et.al. | 2407.14326 | null |
2024-07-19 | Time-dependent condensate formation in ultracold atoms with energy-dependent transport coefficients | M. Larsson et.al. | 2407.14307 | null |
2024-07-18 | Large deviations of Dyson Brownian motion on the circle and multiradial SLE0+ | Osama Abuzaid et.al. | 2407.13762 | null |
2024-07-18 | Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | Boyang Deng et.al. | 2407.13759 | null |
2024-07-18 | LogoSticker: Inserting Logos into Diffusion Models for Customized Generation | Mingkang Zhu et.al. | 2407.13752 | null |
2024-07-18 | Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review | Masatoshi Uehara et.al. | 2407.13734 | link |
2024-07-18 | MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis | Ziming Zhong et.al. | 2407.13675 | link |
2024-07-18 | Are the surface abundance structures stable in rapidly rotating Ap star 56 Ari? | I. Potravnov et.al. | 2407.13645 | null |
2024-07-18 | Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models | Xiaoyu Zhu et.al. | 2407.13642 | null |
2024-07-18 | Training-free Composite Scene Generation for Layout-to-Image Synthesis | Jiaqi Liu et.al. | 2407.13609 | link |
2024-07-18 | Functional Renormalization Group analysis of the quark-condensation pattern on the Fermi surface: A simple effective-model approach | Kie Sang Jeong et.al. | 2407.13589 | null |
2024-07-18 | The long way of a viscous vortex dipole | Michele Dolce et.al. | 2407.13562 | null |
2024-07-17 | SMooDi: Stylized Motion Diffusion Model | Lei Zhong et.al. | 2407.12783 | null |
2024-07-17 | VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control | Sherwin Bahmani et.al. | 2407.12781 | null |
2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780 | null |
2024-07-17 | The Role of Network and Identity in the Diffusion of Hashtags | Aparna Ananthasubramaniam et.al. | 2407.12771 | null |
2024-07-17 | Vanishing viscosity limit for hyperbolic system of Temple class in 1-d with nonlinear viscosity | Boris Haspot et.al. | 2407.12766 | null |
2024-07-17 | GroundUp: Rapid Sketch-Based 3D City Massing | Gizem Esra Unlu et.al. | 2407.12739 | null |
2024-07-17 | NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Zhongqun Zhang et.al. | 2407.12727 | null |
2024-07-18 | SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow | Yuanzhi Zhu et.al. | 2407.12718 | link |
2024-07-17 | Stein's method and general clocks: diffusion approximation of the |
Anton Braverman et.al. | 2407.12716 | null |
2024-07-17 | IMAGDressing-v1: Customizable Virtual Dressing | Fei Shen et.al. | 2407.12705 | link |
2024-07-16 | Efficient Training with Denoised Neural Weights | Yifan Gong et.al. | 2407.11966 | null |
2024-07-16 | UrbanWorld: An Urban World Model for 3D City Generation | Yu Shang et.al. | 2407.11965 | link |
2024-07-16 | Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation | Olga Zatsarynna et.al. | 2407.11954 | link |
2024-07-16 | Spatiotemporal dynamics of ionic reorganization near biological membrane interfaces | Hyeongjoo Row et.al. | 2407.11947 | null |
2024-07-16 | Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design | Leo Klarner et.al. | 2407.11942 | link |
2024-07-16 | Revisiting primordial magnetic fields through 21-cm physics: Bounds and forecasts | Arko Bhaumik et.al. | 2407.11923 | null |
2024-07-16 | Energy dependence of the knee in the cosmic ray spectrum across the Milky Way | C. Prevotat et.al. | 2407.11911 | null |
2024-07-16 | Impact of coherent mode coupling on noise performance in elliptical aperture VCSELs for datacom | Cristina Rimoldi et.al. | 2407.11899 | null |
2024-07-16 | Single Layer Single Gradient Unlearning | Zikui Cai et.al. | 2407.11867 | link |
2024-07-16 | Navigating Munk's Abyssal Recipes: Reconciling the Paradoxes and Suggesting an Upwelling Mechanism for Bottom Water in a Flat-Bottom Ocean | Lei Han et.al. | 2407.11864 | null |
2024-07-15 | Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion | Yongyuan Liang et.al. | 2407.10973 | null |
2024-07-15 | InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models | Nirat Saini et.al. | 2407.10958 | null |
2024-07-15 | IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation | Yuanhao Zhai et.al. | 2407.10937 | link |
2024-07-15 | On the Cyclostationary Linear Inverse Models: A Mathematical Insight and Implication | Justin Lien et.al. | 2407.10931 | null |
2024-07-16 | DataDream: Few-shot Guided Dataset Generation | Jae Myung Kim et.al. | 2407.10910 | link |
2024-07-15 | Optical Diffusion Models for Image Generation | Ilker Oguz et.al. | 2407.10897 | null |
2024-07-15 | R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection | Zheyuan Zhou et.al. | 2407.10862 | null |
2024-07-15 | Physics-Inspired Generative Models in Medical Imaging: A Review | Dennis Hein et.al. | 2407.10856 | null |
2024-07-15 | MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration | Yulin Ren et.al. | 2407.10833 | null |
2024-07-15 | The effective diffusion constant of stochastic processes with spatially periodic noise | Stefano Giordano et.al. | 2407.10813 | null |
2024-07-12 | A Primitive Model for Predicting Membrane Currents in Excitable Cells Based Only on Ion Diffusion Coefficients | Vivaan Patel et.al. | 2407.09474 | null |
2024-07-12 | Weak Chaos, Anomalous Diffusion, and Weak Ergodicity Breaking in Systems with Delay | Tony Albers et.al. | 2407.09449 | null |
2024-07-12 | Intensive broadband reverberation mapping of Fairall 9 with 1.8 years of daily Swift monitoring | R. Edelson et.al. | 2407.09445 | null |
2024-07-12 | Efficient energy-stable parametric finite element methods for surface diffusion flow and applications in solid-state dewetting | Meng Li et.al. | 2407.09418 | null |
2024-07-12 | A Numerical Study of WENO Approximations to Sharp Propagating Fronts for Reaction-Diffusion Systems | Jiaxi Gu et.al. | 2407.09393 | null |
2024-07-12 | Grain boundaries control lithiation of solid solution substrates in lithium metal batteries | Leonardo Shoji Aota et.al. | 2407.09374 | null |
2024-07-12 | Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees | Alexia Jolicoeur-Martineau et.al. | 2407.09357 | link |
2024-07-12 | Thermodynamics of Giant Molecular Clouds: The Effects of Dust Grain Size | Nadine H. Soliman et.al. | 2407.09343 | null |
2024-07-12 | PID: Physics-Informed Diffusion Model for Infrared Image Generation | Fangyuan Mao et.al. | 2407.09299 | link |
2024-07-12 | SS-SfP:Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization | Ashish Tiwari et.al. | 2407.09294 | null |
2024-07-11 | Video Diffusion Alignment via Reward Gradients | Mihir Prabhudesai et.al. | 2407.08737 | link |
2024-07-11 | Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models | Zhening Xing et.al. | 2407.08701 | null |
2024-07-11 | CAD-Prompted Generative Models: A Pathway to Feasible and Novel Engineering Designs | Leah Chong et.al. | 2407.08675 | null |
2024-07-11 | Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density | Shuangqi Li et.al. | 2407.08659 | null |
2024-07-11 | Fine-Tuning Stable Diffusion XL for Stylistic Icon Generation: A Comparison of Caption Size | Youssef Sultan et.al. | 2407.08513 | null |
2024-07-11 | Latent Conditional Diffusion-based Data Augmentation for Continuous-Time Dynamic Graph Mode | Yuxing Tian et.al. | 2407.08500 | null |
2024-07-11 | Killing versus catastrophes in birth-death processes and an application to population genetics | Ellen Baake et.al. | 2407.08478 | null |
2024-07-11 | Propagation and non-reciprocity in time-modulated diffusion through the lens of high-order homogenization | Marie Touboul et.al. | 2407.08456 | null |
2024-07-11 | A fitted space-time finite element method for an advection-diffusion problem with moving interfaces | Quang Huy Nguyen et.al. | 2407.08439 | null |
2024-07-11 | Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers | Zhengbo Zhang et.al. | 2407.08394 | null |
2024-07-10 | Generative Image as Action Models | Mohit Shridhar et.al. | 2407.07875 | link |
2024-07-10 | Dynamical Measure Transport and Neural PDE Solvers for Sampling | Jingtong Sun et.al. | 2407.07873 | null |
2024-07-10 | Controlling Space and Time with Diffusion Models | Daniel Watson et.al. | 2407.07860 | null |
2024-07-10 | Generic Numerical Analysis of Stochastic Reaction Diffusion Model with applications in excitable media | Yahya Alnashri et.al. | 2407.07834 | null |
2024-07-10 | Dynamical signatures of discontinuous phase transitions: How phase coexistence determines exponential versus power-law scaling | Krzysztof Ptaszynski et.al. | 2407.07832 | null |
2024-07-10 | Universal and non-universal signatures in the scaling functions of critical variables | Gianluca Teza et.al. | 2407.07782 | null |
2024-07-10 | Correlation of srf performance to oxygen diffusion length of medium temperature heat treated cavities | C. Bate et.al. | 2407.07779 | null |
2024-07-10 | Challenges in modeling the dark matter halo of NGC 1052-DF2: Cored versus cuspy halo models | K. Aditya et.al. | 2407.07770 | null |
2024-07-10 | Feasibility Study on Active Learning of Smart Surrogates for Scientific Simulations | Pradeep Bajracharya et.al. | 2407.07674 | null |
2024-07-10 | VEnhancer: Generative Space-Time Enhancement for Video Generation | Jingwen He et.al. | 2407.07667 | null |
2024-07-09 | ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction | Shaozhe Hao et.al. | 2407.07077 | link |
2024-07-09 | Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images | Chuanrui Zhang et.al. | 2407.06984 | null |
2024-07-09 | RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models | Bowen Zhang et.al. | 2407.06938 | null |
2024-07-09 | HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance | Guian Fang et.al. | 2407.06937 | link |
2024-07-09 | Dissipation enhancing properties for a class of Hamiltonian flows with closed streamlines | Michele Dolce et.al. | 2407.06884 | null |
2024-07-09 | Relation between asymptotic |
Nuno J. Alves et.al. | 2407.06830 | null |
2024-07-09 | AstroSpy: On detecting Fake Images in Astronomy via Joint Image-Spectral Representations | Mohammed Talha Alam et.al. | 2407.06817 | null |
2024-07-09 | A reaction-diffusion model for relapsing-remitting multiple sclerosis with a treatment term | Romina Travaglini et.al. | 2407.06802 | null |
2024-07-09 | Investigating the Kinetic Effects on Current Gradient-Driven Instabilities of Electron Current Layers via Particle-in-Cell Simulations | Sushmita Mishra et.al. | 2407.06799 | null |
2024-07-09 | Extinction profiles for the Sobolev critical fast diffusion equation in bounded domains. I. One bubble dynamics | Tianling Jin et.al. | 2407.06757 | null |
2024-07-08 | Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images | Zhangyang Qi et.al. | 2407.06191 | null |
2024-07-08 | JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation | Yu Zeng et.al. | 2407.06187 | null |
2024-07-08 | The Tug-of-War Between Deepfake Generation and Detection | Hannah Lee et.al. | 2407.06174 | null |
2024-07-08 | Potential Based Diffusion Motion Planning | Yunhao Luo et.al. | 2407.06169 | null |
2024-07-08 | Loewner traces driven by Levy processes | Eveliina Peltola et.al. | 2407.06144 | null |
2024-07-08 | ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation | Ethan Chern et.al. | 2407.06135 | link |
2024-07-08 | Structured Generations: Using Hierarchical Clusters to guide Diffusion Models | Jorge da Silva Goncalves et.al. | 2407.06124 | link |
2024-07-08 | PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models | Jinhua Zhang et.al. | 2407.06109 | link |
2024-07-08 | Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation | Xinyu Bai et.al. | 2407.06095 | null |
2024-07-08 | Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis | Emaad Khwaja et.al. | 2407.06079 | null |
2024-07-05 | Two methods to analyse radial diffusion ensembles: the peril of space- and time- dependent diffusion | Sarah N. Bentley et.al. | 2407.04669 | null |
2024-07-05 | Strongly consistent low-dissipation WENO schemes for finite elements | Joshua Vedral et.al. | 2407.04646 | null |
2024-07-05 | Randomized Physics-Informed Neural Networks for Bayesian Data Assimilation | Yifei Zong et.al. | 2407.04617 | link |
2024-07-05 | An SDE Perspective on Stochastic Inertial Gradient Dynamics with Time-Dependent Viscosity and Geometric Damping | Rodrigo Maulen-Soto et.al. | 2407.04562 | null |
2024-07-05 | Structural Constraint Integration in Generative Model for Discovery of Quantum Material Candidates | Ryotaro Okabe et.al. | 2407.04557 | null |
2024-07-05 | More is Different: Mobile Ions Improve the Design Tolerances of Perovskite Solar Cells | Lucy J. F. Hart et.al. | 2407.04523 | null |
2024-07-05 | Unified continuous-time q-learning for mean-field game and mean-field control problems | Xiaoli Wei et.al. | 2407.04521 | null |
2024-07-05 | G-Adaptive mesh refinement -- leveraging graph neural networks and differentiable finite element solvers | James Rowbottom et.al. | 2407.04516 | link |
2024-07-05 | Analysis of SIR Reaction diffusion system with constant birth and death rate | Yiting Yao et.al. | 2407.04509 | null |
2024-07-05 | Speed-accuracy trade-off for the diffusion models: Wisdom from nonequlibrium thermodynamics and optimal transport | Kotaro Ikeda et.al. | 2407.04495 | null |
2024-07-03 | DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents | Yilun Xu et.al. | 2407.03300 | link |
2024-07-03 | Improved Noise Schedule for Diffusion Training | Tiankai Hang et.al. | 2407.03297 | null |
2024-07-03 | LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control | Jianzhu Guo et.al. | 2407.03168 | link |
2024-07-03 | Closing Pandora's Box -- The deepest X-ray observations of Abell 2744 and a multi-wavelength merger picture | Urmila Chadayammuri et.al. | 2407.03142 | link |
2024-07-03 | Chao Zhou et.al. | 2407.03115 | null | |
2024-07-04 | Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis | Tong Zhou et.al. | 2407.03089 | null |
2024-07-03 | Electromagnetic Property Sensing Based on Diffusion Model in ISAC System | Yuhua Jiang et.al. | 2407.03075 | null |
2024-07-03 | Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models | Chunmei Xu et.al. | 2407.03050 | null |
2024-07-03 | SlerpFace: Face Template Protection via Spherical Linear Interpolation | Zhizhou Zhong et.al. | 2407.03043 | null |
2024-07-03 | NLP Sampling: Combining MCMC and NLP Methods for Diverse Constrained Sampling | Marc Toussaint et.al. | 2407.03035 | null |
2024-07-02 | Magic Insert: Style-Aware Drag-and-Drop | Nataniel Ruiz et.al. | 2407.02489 | null |
2024-07-02 | Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models | Fei Shen et.al. | 2407.02482 | link |
2024-07-02 | Mirages and Large TeV Halo-Pulsar Offsets from Cosmic Ray Propagation | Yiwei Bao et.al. | 2407.02478 | null |
2024-07-02 | Diffusion and pattern formation in spatial games | Alexandre Champagne-Ruel et.al. | 2407.02385 | link |
2024-07-02 | OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation | Kepan Nan et.al. | 2407.02371 | null |
2024-07-02 | Hypersonic Boundary Layer Transition and Heat Loading | Ahmad Peyvan et.al. | 2407.02311 | null |
2024-07-02 | Turbulent Diffuse Molecular Media with Non-ideal Magnetohydrodynamics and Consistent Thermochemistry: Numerical Simulations and Dynamic Characteristics | Nannan Yue et.al. | 2407.02306 | null |
2024-07-02 | On the multicomponent reactive flows in moving domains | Kuntal Bhandari et.al. | 2407.02303 | null |
2024-07-02 | Solution of parameter-dependent diffusion equation in layered media | Antti Autio et.al. | 2407.02257 | null |
2024-07-02 | GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models | Jian Ma et.al. | 2407.02252 | link |
2024-06-28 | Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language | Yicheng Chen et.al. | 2406.20085 | null |
2024-06-28 | HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model | Hieu T. Nguyen et.al. | 2406.20077 | null |
2024-06-28 | Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence | Xiantao Fan et.al. | 2406.20047 | null |
2024-06-28 | HAITCH: A Framework for Distortion and Motion Correction in Fetal Multi-Shell Diffusion-Weighted MRI | Haykel Snoussi et.al. | 2406.20042 | null |
2024-06-28 | Kinetics of Quantum Reaction-Diffusion systems | Federico Gerbino et.al. | 2406.20028 | null |
2024-06-28 | Information Entropy of the Financial Market: Modelling Random Processes Using Open Quantum Systems | Will Hicks et.al. | 2406.20027 | null |
2024-06-28 | Learning glass transition temperatures via dimensionality reduction with data from computer simulations: Polymers as the pilot case | Artem Glova et.al. | 2406.20018 | link |
2024-06-28 | On the Trade-off between Flatness and Optimization in Distributed Learning | Ying Cao et.al. | 2406.20006 | null |
2024-06-28 | The |
Leon M. G. de la Vega et.al. | 2406.19968 | null |
2024-06-28 | RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization | Bing Yang et.al. | 2406.19959 | link |
2024-06-27 | Asymptotic Properties of Generalized Elephant Random Walks | Krishanu Maulik et.al. | 2406.19383 | null |
2024-06-27 | Spontaneous symmetry breaking in open quantum systems: strong, weak, and strong-to-weak | Ding Gu et.al. | 2406.19381 | null |
2024-06-27 | Accelerating Multiphase Flow Simulations with Denoising Diffusion Model Driven Initializations | Jaehong Chung et.al. | 2406.19333 | null |
2024-06-27 | Subtractive Training for Music Stem Insertion using Latent Diffusion Models | Ivan Villa-Renteria et.al. | 2406.19328 | null |
2024-06-27 | Vector Resonant Relaxation and Statistical Closure Theory. I. Direct Interaction Approximation | Sofia Flores et.al. | 2406.19306 | null |
2024-06-27 | Compositional Image Decomposition with Diffusion Models | Jocelin Su et.al. | 2406.19298 | null |
2024-06-27 | Advection Augmented Convolutional Neural Networks | Niloufar Zakariaei et.al. | 2406.19253 | link |
2024-06-27 | Diffuse interstellar bands in the near-infrared: Expanding the reddening range | R. Castellanos et.al. | 2406.19229 | null |
2024-06-27 | Numerical Analysis of the Complete Active-Space Extended Koopmans's Theorem | Reza Hemmati et.al. | 2406.19211 | null |
2024-06-27 | The case for Centaurus A as the main source of ultrahigh-energy cosmic rays | Silvia Mollerach et.al. | 2406.19199 | null |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-26 | Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration | Kang Liao et.al. | 2406.18516 | link |
2024-06-26 | Ground states of a nonlocal variational problem and Thomas-Fermi limit for the Choquard equation | Damiano Greco et.al. | 2406.18472 | null |
2024-06-26 | DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance | Younghyun Kim et.al. | 2406.18459 | link |
2024-06-26 | How to Achieve High Spatial Resolution in Organic Optobioelectronic Devices? | Luca Fabbri et.al. | 2406.18447 | null |
2024-06-26 | Towards diffusion models for large-scale sea-ice modelling | Tobias Sebastian Finn et.al. | 2406.18417 | link |
2024-06-26 | From Majority to Minority: A Diffusion-based Augmentation for Underrepresented Groups in Skin Lesion Analysis | Janet Wang et.al. | 2406.18375 | link |
2024-06-27 | Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process | Tianyu Lin et.al. | 2406.18361 | link |
2024-06-26 | Convergence to equilibrium for a degenerate three species reaction-diffusion system | Saumyajit Das et.al. | 2406.18339 | null |
2024-06-26 | Molecular Diffusion Models with Virtual Receptors | Matan Halfon et.al. | 2406.18330 | null |
2024-06-25 | DiffusionPDE: Generative PDE-Solving Under Partial Observation | Jiahe Huang et.al. | 2406.17763 | link |
2024-06-25 | Fairness in Social Influence Maximization via Optimal Transport | Shubham Chowdhary et.al. | 2406.17736 | link |
2024-06-25 | Extreme Diffusion Measures Statistical Fluctuations of the Environment | Jacob Hass et.al. | 2406.17733 | null |
2024-06-25 | On Explicit Solutions for Coupled Reaction-Diffusion and Burgers-Type Equations with Variable Coefficients Through a Riccati System | José M. Escorcia et.al. | 2406.17690 | null |
2024-06-25 | Unified Auto-Encoding with Masked Diffusion | Philippe Hansen-Estruch et.al. | 2406.17688 | link |
2024-06-25 | Asymptotic Properties of Random Homology Induced by Diffusion Processes | Artem Galkin et.al. | 2406.17683 | null |
2024-06-25 | LaTable: Towards Large Tabular Models | Boris van Breugel et.al. | 2406.17673 | null |
2024-06-26 | SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond | Marco Comunità et.al. | 2406.17672 | null |
2024-06-25 | Aligning Diffusion Models with Noise-Conditioned Perception | Alexander Gambashidze et.al. | 2406.17636 | null |
2024-06-25 | Test-Time Generative Augmentation for Medical Image Segmentation | Xiao Ma et.al. | 2406.17608 | link |
2024-06-24 | StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal | Chongjie Ye et.al. | 2406.16864 | null |
2024-06-24 | FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models | Haonan Qiu et.al. | 2406.16863 | link |
2024-06-24 | Dreamitate: Real-World Visuomotor Policy Learning via Video Generation | Junbang Liang et.al. | 2406.16862 | null |
2024-06-24 | General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design | Yue Jian et.al. | 2406.16821 | null |
2024-06-24 | Damping effects of viscous dissipation on growth of symmetric instability | Laur Ferris et.al. | 2406.16818 | null |
2024-06-24 | ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians | Yufei Liu et.al. | 2406.16815 | null |
2024-06-24 | 3D distortion-free, reduced field of view diffusion-prepared GRE at 3T | Sarah McElroy et.al. | 2406.16809 | null |
2024-06-24 | Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image | Jinkun Hao et.al. | 2406.16710 | null |
2024-06-24 | Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling | Min-Seop Kwak et.al. | 2406.16695 | null |
2024-06-24 | Repulsive Score Distillation for Diverse Sampling of Diffusion Models | Nicolas Zilberstein et.al. | 2406.16683 | link |
2024-06-21 | Network-Based Optimal Control of Pollution Growth | Fausto Gozzi et.al. | 2406.15338 | null |
2024-06-21 | Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild | Nadav Orzech et.al. | 2406.15331 | null |
2024-06-21 | Global existence of solutions to a nonlocal equation with degenerate anisotropic diffusion | Maria Eckardt et.al. | 2406.15318 | null |
2024-06-21 | Interacting phase fields yielding phase separation on surfaces | Benjamin Lledos et.al. | 2406.15300 | null |
2024-06-21 | Symmetry-controlled SrRuO3/SrTiO3/SrRuO3 magnetic tunnel junctions:Spin polarization and its relevance to tunneling magnetoresistance | Kartik Samanta et.al. | 2406.15290 | null |
2024-06-21 | The random walk of intermittently self-propelled particles | Agniva Datta et.al. | 2406.15277 | null |
2024-06-21 | You Only Acquire Sparse-channel (YOAS): A Unified Framework for Dense-channel EEG Generation | Hongyu Chen et.al. | 2406.15269 | null |
2024-06-21 | Can cosmic rays explain the high ionisation rates in the Galactic Centre? | Sruthiranjani Ravikularaman et.al. | 2406.15260 | null |
2024-06-21 | Drag reduction in surfactant-contaminated superhydrophobic channels at high Péclet numbers | Samuel D. Tomlinson et.al. | 2406.15251 | null |
2024-06-21 | Unconstrained dynamic gel swelling generates transient surface deformations | Alyssa VanZanten et.al. | 2406.15224 | null |
2024-06-20 | A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models | Xincheng Shuai et.al. | 2406.14555 | link |
2024-06-21 | Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation | Eyal Michaeli et.al. | 2406.14551 | link |
2024-06-20 | Consistency Models Made Easy | Zhengyang Geng et.al. | 2406.14548 | link |
2024-06-20 | Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps | Nikita Starodubcev et.al. | 2406.14539 | null |
2024-06-20 | Formulation of Chimera Gradient Flows for Chemotaxis Systems with Indirect Signal Production and Degenerate Diffusion | Yoshifumi Mimura et.al. | 2406.14536 | null |
2024-06-20 | ForSE+: Simulating non-Gaussian CMB foregrounds at 3 arcminutes in a stochastic way based on a generative adversarial network | Jian Yao et.al. | 2406.14519 | link |
2024-06-20 | V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data | Rotem Shalev-Arkushin et.al. | 2406.14510 | null |
2024-06-20 | SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset | Josef Dai et.al. | 2406.14477 | link |
2024-06-20 | Video Generation with Learned Action Prior | Meenakshi Sarkar et.al. | 2406.14436 | null |
2024-06-20 | CollaFuse: Collaborative Diffusion Models | Simeon Allmendinger et.al. | 2406.14429 | link |
2024-06-18 | A Characterization of Semi-Involutory MDS Matrices | Tapas Chatterjee et.al. | 2406.12842 | null |
2024-06-18 | Evaluating the design space of diffusion-based generative models | Yuqing Wang et.al. | 2406.12839 | null |
2024-06-18 | Influence Maximization via Graph Neural Bandits | Yuting Feng et.al. | 2406.12835 | link |
2024-06-18 | Neural Approximate Mirror Maps for Constrained Diffusion Models | Berthy T. Feng et.al. | 2406.12816 | null |
2024-06-18 | The Mathematics of Dots and Pixels: On the Theoretical Foundations of Image Halftoning | Felix Krahmer et.al. | 2406.12760 | null |
2024-06-18 | Extracting Training Data from Unconditional Diffusion Models | Yunhao Chen et.al. | 2406.12752 | null |
2024-06-18 | Using the Haken-Strobl-Reineker Model to Determine the Temperature Dependence of the Diffusion Coefficient | William Barford et.al. | 2406.12750 | null |
2024-06-18 | Liouville results for semilinear integral equations with conical diffusion | Isabeau Birindelli et.al. | 2406.12720 | null |
2024-06-18 | Concurrent Accretion and Migration of Giant Planets in their Natal Disks with Consistent Accretion Torque | Ya-Ping Li et.al. | 2406.12716 | null |
2024-06-18 | Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation | Miseul Kim et.al. | 2406.12688 | null |
2024-06-17 | Autoregressive Image Generation without Vector Quantization | Tianhong Li et.al. | 2406.11838 | link |
2024-06-17 | Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models | Bingqi Ma et.al. | 2406.11831 | null |
2024-06-17 | MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung et.al. | 2406.11819 | link |
2024-06-17 | DiffMM: Multi-Modal Diffusion Model for Recommendation | Yangqin Jiang et.al. | 2406.11781 | link |
2024-06-17 | Simulation of bright and dark diffuse multiple scattering lines in high-flux synchrotron X-ray experiments | M. B. Estradiote et.al. | 2406.11764 | null |
2024-06-17 | Site-percolation transition of run-and-tumble particles | Soumya K. Saha et.al. | 2406.11726 | null |
2024-06-17 | Latent Denoising Diffusion GAN: Faster sampling, Higher image quality | Luan Thanh Trinh et.al. | 2406.11713 | link |
2024-06-17 | Tackling the Curse of Dimensionality in Fractional and Tempered Fractional PDEs with Physics-Informed Neural Networks | Zheyuan Hu et.al. | 2406.11708 | null |
2024-06-17 | Diffusion Generative Modelling for Divide-and-Conquer MCMC | C. Trojan et.al. | 2406.11664 | link |
2024-06-17 | AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection | Lingjie Kong et.al. | 2406.11643 | link |
2024-06-14 | VideoGUI: A Benchmark for GUI Automation from Instructional Videos | Kevin Qinghong Lin et.al. | 2406.10227 | null |
2024-06-14 | SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models | Zhaoxu Luo et.al. | 2406.10225 | null |
2024-06-14 | Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation | Nameer Hirschkind et.al. | 2406.10223 | null |
2024-06-14 | DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction | Bowen Song et.al. | 2406.10211 | null |
2024-06-14 | Make It Count: Text-to-Image Generation with an Accurate Number of Objects | Lital Binyamin et.al. | 2406.10210 | null |
2024-06-14 | Crafting Parts for Expressive Object Composition | Harsh Rangwani et.al. | 2406.10197 | null |
2024-06-14 | Training-free Camera Control for Video Generation | Chen Hou et.al. | 2406.10126 | null |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | Convergence to equilibrium for cross diffusion systems with nonlocal interaction | Daniel Matthes et.al. | 2406.10075 | null |
2024-06-14 | Partial stochastic resetting with refractory periods | Kristian Stølevik Olsen et.al. | 2406.10039 | null |
2024-06-13 | Rethinking Score Distillation as a Bridge Between Image Distributions | David McAllister et.al. | 2406.09417 | null |
2024-06-13 | Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models | Qihao Liu et.al. | 2406.09416 | null |
2024-06-13 | An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels | Duy-Kien Nguyen et.al. | 2406.09415 | null |
2024-06-13 | Depth Anything V2 | Lihe Yang et.al. | 2406.09414 | link |
2024-06-13 | Interpreting the Weight Space of Customized Diffusion Models | Amil Dravid et.al. | 2406.09413 | link |
2024-06-13 | ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing | Jun-Kun Chen et.al. | 2406.09404 | null |
2024-06-13 | Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion | Linzhan Mou et.al. | 2406.09402 | null |
2024-06-13 | OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation | Junke Wang et.al. | 2406.09399 | link |
2024-06-13 | WonderWorld: Interactive 3D Scene Generation from a Single Image | Hong-Xing Yu et.al. | 2406.09394 | null |
2024-06-13 | Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior | Baiang Li et.al. | 2406.09389 | link |
2024-06-12 | Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation | Raphael Tang et.al. | 2406.08482 | null |
2024-06-12 | What If We Recaption Billions of Web Images with LLaMA-3? | Xianhang Li et.al. | 2406.08478 | null |
2024-06-12 | Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models | Yuxuan Xue et.al. | 2406.08475 | null |
2024-06-12 | Pranath Reddy et.al. | 2406.08442 | null | |
2024-06-12 | Effect of Cr Segregation on Grain Growth in Nanocrystalline α-Fe Alloy: A Multiscale Modelling Approach | Sandip Guin et.al. | 2406.08437 | null |
2024-06-12 | Diffusion Soup: Model Merging for Text-to-Image Diffusion Models | Benjamin Biggs et.al. | 2406.08431 | null |
2024-06-12 | FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation | Xinzhi Mu et.al. | 2406.08392 | null |
2024-06-12 | Resetting by rescaling: exact results for a diffusing particle in one-dimension | Marco Biroli et.al. | 2406.08387 | null |
2024-06-12 | Diff-A-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models | Javier Nistal et.al. | 2406.08384 | null |
2024-06-12 | 2.5D Multi-view Averaging Diffusion Model for 3D Medical Image Translation: Application to Low-count PET Reconstruction with CT-less Attenuation Correction | Tianqi Chen et.al. | 2406.08374 | null |
2024-06-11 | An Image is Worth 32 Tokens for Reconstruction and Generation | Qihang Yu et.al. | 2406.07550 | link |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548 | link |
2024-06-11 | Zero-shot Image Editing with Reference Imitation | Xi Chen et.al. | 2406.07547 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540 | null |
2024-06-11 | Simple and Effective Masked Diffusion Language Models | Subham Sekhar Sahoo et.al. | 2406.07524 | link |
2024-06-11 | Neural Gaffer: Relighting Any Object via Diffusion | Haian Jin et.al. | 2406.07520 | null |
2024-06-11 | Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros et.al. | 2406.07516 | null |
2024-06-11 | Flow Map Matching | Nicholas M. Boffi et.al. | 2406.07507 | null |
2024-06-11 | Understanding Visual Concepts Across Models | Brandon Trabucco et.al. | 2406.07506 | link |
2024-06-10 | IllumiNeRF: 3D Relighting without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527 | null |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525 | link |
2024-06-10 | NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing | Ting-Hsuan Chen et.al. | 2406.06523 | link |
2024-06-10 | Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer | Sigal Raab et.al. | 2406.06508 | link |
2024-06-10 | Rephasing spectral diffusion in time-bin spin-spin entanglement protocols | Mehmet T. Uysal et.al. | 2406.06497 | null |
2024-06-10 | Probing the Heights and Depths of Y Dwarf Atmospheres: A Retrieval Analysis of the JWST Spectral Energy Distribution of WISE J035934.06 |
Harshil Kothari et.al. | 2406.06493 | null |
2024-06-10 | AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction | Zhen Xing et.al. | 2406.06465 | null |
2024-06-10 | Cometh: A continuous-time discrete-state graph diffusion model | Antoine Siraudin et.al. | 2406.06449 | null |
2024-06-10 | QSSEP describes the fluctuations of quantum coherences in the Anderson model | Ludwig Hruza et.al. | 2406.06444 | null |
2024-06-10 | Margin-aware Preference Optimization for Aligning Diffusion Models without Reference | Jiwoo Hong et.al. | 2406.06424 | null |
2024-06-07 | DVOS: Self-Supervised Dense-Pattern Video Object Segmentation | Keyhan Najafian et.al. | 2406.05131 | null |
2024-06-07 | Ohms law lost and regained: observation and impact of zeros and poles | Krishna Joshi et.al. | 2406.05112 | null |
2024-06-07 | Large Generative Graph Models | Yu Wang et.al. | 2406.05109 | null |
2024-06-07 | CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion | Xingrui Wang et.al. | 2406.05082 | null |
2024-06-07 | GenHeld: Generating and Editing Handheld Objects | Chaerin Min et.al. | 2406.05059 | link |
2024-06-07 | Digital Twins of the EM Environment: Benchmark for Ray Launching Models | Michele Zhu et.al. | 2406.05042 | link |
2024-06-07 | Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs | Shentong Mo et.al. | 2406.05038 | null |
2024-06-07 | Linear stability analysis for a system of singular amplitude equations arising in biomorphology | Aric Wheeler et.al. | 2406.05037 | null |
2024-06-07 | Generative diffusion models for synthetic trajectories of heavy and light particles in turbulence | Tianyi Li et.al. | 2406.05008 | null |
2024-06-07 | CityCraft: A Real Crafter for 3D City Generation | Jie Deng et.al. | 2406.04983 | null |
2024-06-06 | GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang et.al. | 2406.04340 | link |
2024-06-07 | Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion | Fangfu Liu et.al. | 2406.04338 | null |
2024-06-06 | Coherent Zero-Shot Visual Instruction Generation | Quynh Phung et.al. | 2406.04337 | null |
2024-06-06 | BitsFusion: 1.99 bits Weight Quantization of Diffusion Model | Yang Sui et.al. | 2406.04333 | link |
2024-06-06 | Simplified and Generalized Masked Diffusion for Discrete Data | Jiaxin Shi et.al. | 2406.04329 | link |
2024-06-06 | SF-V: Single Forward Video Generation Model | Zhixing Zhang et.al. | 2406.04324 | link |
2024-06-06 | ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories | Qianlan Yang et.al. | 2406.04323 | null |
2024-06-07 | DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data | Qihao Liu et.al. | 2406.04322 | link |
2024-06-06 | Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step | Zhanhao Liang et.al. | 2406.04314 | link |
2024-06-06 | ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization | Luca Eyring et.al. | 2406.04312 | link |
2024-06-05 | Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input | Joachim Ott et.al. | 2406.03439 | null |
2024-06-05 | Non-stationary Spatio-Temporal Modeling Using the Stochastic Advection-Diffusion Equation | Martin Outzen Berild et.al. | 2406.03400 | link |
2024-06-05 | Reparameterization invariance in approximate Bayesian inference | Hrittik Roy et.al. | 2406.03334 | null |
2024-06-05 | UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning | Yu Zhang et.al. | 2406.03324 | null |
2024-06-05 | Text-to-Image Rectified Flow as Plug-and-Play Priors | Xiaofeng Yang et.al. | 2406.03293 | link |
2024-06-05 | Relative Entropy for the Numerical Diffusive Limit of the Linear Jin-Xin System | Marianne Bessemoulin-Chatard et.al. | 2406.03268 | null |
2024-06-05 | Generative Diffusion Models for Fast Simulations of Particle Collisions at CERN | Mikołaj Kita et.al. | 2406.03233 | null |
2024-06-05 | Holographic drag force with translational symmetry breaking | Sara Tahery et.al. | 2406.03220 | null |
2024-06-05 | Searching Priors Makes Text-to-Video Synthesis Better | Haoran Cheng et.al. | 2406.03215 | null |
2024-06-05 | Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion | Hao Wen et.al. | 2406.03184 | link |
2024-06-04 | Dreamguider: Improved Training free Diffusion-based Conditional Generation | Nithin Gopalakrishnan Nair et.al. | 2406.02549 | null |
2024-06-05 | Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting | Inkyu Shin et.al. | 2406.02541 | null |
2024-06-04 | ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation | Tianchen Zhao et.al. | 2406.02540 | link |
2024-06-04 | CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation | Dejia Xu et.al. | 2406.02509 | null |
2024-06-04 | Guiding a Diffusion Model with a Bad Version of Itself | Tero Karras et.al. | 2406.02507 | link |
2024-06-04 | Tensor Network Space-Time Spectral Collocation Method for Solving the Nonlinear Convection Diffusion Equation | Dibyendu Adak et.al. | 2406.02505 | null |
2024-06-04 | Singular Subspace Perturbation Bounds via Rectangular Random Matrix Diffusions | Peiyao Lai et.al. | 2406.02502 | null |
2024-06-04 | Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation | Jiajun Wang et.al. | 2406.02485 | link |
2024-06-04 | Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion | Colin Hansen et.al. | 2406.02477 | null |
2024-06-04 | Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems | Jason Hu et.al. | 2406.02462 | link |
2024-05-31 | Mixed Diffusion for 3D Indoor Scene Synthesis | Siyi Hu et.al. | 2405.21066 | link |
2024-05-31 | Unified Directly Denoising for Both Variance Preserving and Variance Exploding Diffusion Models | Jingjing Wang et.al. | 2405.21059 | null |
2024-05-31 | Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models | Xinxi Zhang et.al. | 2405.21050 | null |
2024-05-31 | Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Jiatao Gu et.al. | 2405.21048 | null |
2024-05-31 | Beyond Conventional Parametric Modeling: Data-Driven Framework for Estimation and Prediction of Time Activity Curves in Dynamic PET Imaging | Niloufar Zakariaei et.al. | 2405.21021 | null |
2024-05-31 | Amortizing intractable inference in diffusion models for vision, language, and control | Siddarth Venkatraman et.al. | 2405.20971 | link |
2024-06-03 | Large Language Models are Zero-Shot Next Location Predictors | Ciro Beneduce et.al. | 2405.20962 | link |
2024-05-31 | Search of extended emission from HESS J1702-420 with eROSITA | Denys Malyshev et.al. | 2405.20927 | null |
2024-05-31 | Flow matching achieves minimax optimal convergence | Kenji Fukumizu et.al. | 2405.20879 | null |
2024-05-31 | MegActor: Harness the Power of Raw Video for Vivid Portrait Animation | Shurong Yang et.al. | 2405.20851 | link |
2024-05-30 | Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image | Kailu Wu et.al. | 2405.20343 | link |
2024-05-30 | OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving | Lening Wang et.al. | 2405.20337 | link |
2024-05-30 | VividDream: Generating 3D Scene with Ambient Dynamics | Yao-Chih Lee et.al. | 2405.20334 | null |
2024-05-30 | MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion | Shuyuan Tu et.al. | 2405.20325 | link |
2024-05-30 | Don't drop your samples! Coherence-aware training benefits Conditional diffusion | Nicolas Dufour et.al. | 2405.20324 | null |
2024-05-30 | Improving the Training of Rectified Flows | Sangyun Lee et.al. | 2405.20320 | link |
2024-05-30 | DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation | Zachary Novack et.al. | 2405.20289 | null |
2024-05-30 | SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow | Chaoyang Wang et.al. | 2405.20282 | link |
2024-05-30 | CV-VAE: A Compatible Video VAE for Latent Generative Video Models | Sijie Zhao et.al. | 2405.20279 | link |
2024-05-31 | KerasCV and KerasNLP: Vision and Language Power-Ups | Matthew Watson et.al. | 2405.20247 | null |
2024-05-29 | X-VILA: Cross-Modality Alignment for Large Language Model | Hanrong Ye et.al. | 2405.19335 | null |
2024-05-29 | Hilbert Space Diffusion in Systems with Approximate Symmetries | Rahel L. Baumgartner et.al. | 2405.19260 | null |
2024-05-29 | Weak Generative Sampler to Efficiently Sample Invariant Distribution of Stochastic Differential Equation | Zhiqiang Cai et.al. | 2405.19256 | null |
2024-05-29 | ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning | Ruchika Chavhan et.al. | 2405.19237 | link |
2024-05-29 | Pseudo-Gevrey Smoothing for the Passive Scalar Equations near Couette | Jacob Bedrossian et.al. | 2405.19233 | null |
2024-05-29 | DiPPeST: Diffusion-based Path Planner for Synthesizing Trajectories Applied on Quadruped Robots | Maria Stamatopoulou et.al. | 2405.19232 | null |
2024-05-29 | Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification | Michail Mamalakis et.al. | 2405.19204 | null |
2024-05-30 | Weitian Zhang et.al. | 2405.19203 | null | |
2024-05-29 | Going beyond compositional generalization, DDPMs can produce zero-shot interpolation | Justin Deschenaux et.al. | 2405.19201 | link |
2024-05-29 | Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning | Hanye Zhao et.al. | 2405.19189 | link |
2024-05-28 | On the Origin of Llamas: Model Tree Heritage Recovery | Eliahu Horwitz et.al. | 2405.18432 | link |
2024-05-28 | DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention | Lianghui Zhu et.al. | 2405.18428 | link |
2024-05-28 | Phased Consistency Model | Fu-Yun Wang et.al. | 2405.18407 | link |
2024-05-28 | RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives | Jaehong Yoon et.al. | 2405.18406 | link |
2024-05-28 | Short-time Fokker-Planck propagator beyond the Gaussian approximation | Julian Kappler et.al. | 2405.18381 | link |
2024-05-28 | A Hessian-Aware Stochastic Differential Equation for Modelling SGD | Xiang Li et.al. | 2405.18373 | null |
2024-05-28 | Simulating infinite-dimensional nonlinear diffusion bridges | Gefan Yang et.al. | 2405.18353 | link |
2024-05-28 | VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers | Jun Zheng et.al. | 2405.18326 | null |
2024-05-28 | Multi-modal Generation via Cross-Modal In-Context Learning | Amandeep Kumar et.al. | 2405.18304 | link |
2024-05-28 | CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths | Reihaneh Teimouri et.al. | 2405.18267 | link |
2024-05-27 | Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control | Zhengfei Kuang et.al. | 2405.17414 | null |
2024-05-27 | Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer | Ruizhi Shao et.al. | 2405.17405 | null |
2024-05-27 | A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training | Kai Wang et.al. | 2405.17403 | link |
2024-05-27 | RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control | Litu Rout et.al. | 2405.17401 | null |
2024-05-27 | EASI-Tex: Edge-Aware Mesh Texturing from Single Image | Sai Raj Kishore Perla et.al. | 2405.17393 | null |
2024-05-27 | Global existence, fast signal diffusion limit, and |
Cordula Reisch et.al. | 2405.17392 | null |
2024-05-27 | Supernova Remnants in Gamma Rays | Andrea Giuliani et.al. | 2405.17384 | null |
2024-05-27 | Muon spin relaxation in mixed perovskite (LaAlO $3$)${x}$(SrAl${0.5}$Ta${0.5}$O$3$)${1-x}$ with |
Takashi U. Ito et.al. | 2405.17371 | null |
2024-05-27 | Finite Fractal Dimension of uniform attractors for non-autonomous dynamical systems with infinite dimensional symbol space | Rafael de Oliveira Moura et.al. | 2405.17367 | null |
2024-05-27 | Emergent time crystal from a fractional Langevin equation with white and colored noise | David Santiago Quevedo et.al. | 2405.17331 | null |
2024-05-24 | Self-consistent evaluation of proximity and inverse proximity effects with pair-breaking in diffusive SN junctions | Arpit Raj et.al. | 2405.15770 | null |
2024-05-24 | FastDrag: Manipulate Anything in One Step | Xuanjia Zhao et.al. | 2405.15769 | null |
2024-05-24 | InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation | Yuchi Wang et.al. | 2405.15758 | link |
2024-05-24 | Looking Backward: Streaming Video-to-Video Translation with Feature Banks | Feng Liang et.al. | 2405.15757 | link |
2024-05-24 | Score-based generative models are provably robust: an uncertainty quantification perspective | Nikiforos Mimikos-Stamatopoulos et.al. | 2405.15754 | null |
2024-05-24 | Murray-von Neumann dimension for strictly semifinite weights | Aldo Garcia Guinto et.al. | 2405.15725 | null |
2024-05-24 | Hierarchical Uncertainty Exploration via Feedforward Posterior Trees | Elias Nehme et.al. | 2405.15719 | null |
2024-05-24 | Simulation-based inference of radio millisecond pulsars in globular clusters | Joanna Berteaud et.al. | 2405.15691 | null |
2024-05-24 | Jet Quenching of the Heavy Quarks in the Quark-Gluon Plasma and the Nonadditive Statistics | Trambak Bhattacharyya et.al. | 2405.15679 | null |
2024-05-24 | Taming Score-Based Diffusion Priors for Infinite-Dimensional Nonlinear Inverse Problems | Lorenzo Baldassari et.al. | 2405.15676 | null |
2024-05-23 | Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis | Basile Van Hoorick et.al. | 2405.14868 | null |
2024-05-23 | Improved Distribution Matching Distillation for Fast Image Synthesis | Tianwei Yin et.al. | 2405.14867 | link |
2024-05-23 | Video Diffusion Models are Training-free Motion Interpreter and Controller | Zeqi Xiao et.al. | 2405.14864 | null |
2024-05-23 | Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models | Gen Li et.al. | 2405.14861 | null |
2024-05-23 | Semantica: An Adaptable Image-Conditioned Diffusion Model | Manoj Kumar et.al. | 2405.14857 | null |
2024-05-23 | TerDiT: Ternary Diffusion Models with Transformers | Xudong Lu et.al. | 2405.14854 | link |
2024-05-23 | Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer | Shuang Wu et.al. | 2405.14832 | null |
2024-05-23 | Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models | Katherine Xu et.al. | 2405.14828 | null |
2024-05-23 | New limits on neutrino decay from high-energy astrophysical neutrinos | Victor B. Valera et.al. | 2405.14826 | null |
2024-05-23 | PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher | Dongjun Kim et.al. | 2405.14822 | link |
2024-05-21 | Personalized Residuals for Concept-Driven Text-to-Image Generation | Cusuh Ham et.al. | 2405.12978 | null |
2024-05-21 | Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control | Yue Han et.al. | 2405.12970 | null |
2024-05-21 | Differential Walk on Spheres | Bailey Miller et.al. | 2405.12964 | null |
2024-05-21 | Learning the Infinitesimal Generator of Stochastic Diffusion Processes | Vladimir R. Kostic et.al. | 2405.12940 | null |
2024-05-21 | Impact of inhomogeneous diffusion on secondary cosmic ray and antiproton local spectra | Álvaro Tovar-Pardo et.al. | 2405.12918 | null |
2024-05-21 | An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation | Zhiyu Tan et.al. | 2405.12914 | link |
2024-05-21 | Deep HST/UVIS imaging of the candidate dark galaxy CDG-1 | Pieter van Dokkum et.al. | 2405.12907 | null |
2024-05-21 | Diffusion of brightened dark excitons in a high-angle incommensurate Moiré homobilayer | Arnab Barman Ray et.al. | 2405.12901 | null |
2024-05-21 | Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images | Xiaofei Yu et.al. | 2405.12875 | link |
2024-05-21 | High-Field Microscale NMR Spectroscopy with NV Centers in Dipolarly-Coupled Samples | Carlos Munuera-Javaloy et.al. | 2405.12857 | null |
2024-05-20 | Images that Sound: Composing Images and Sounds on a Single Canvas | Ziyang Chen et.al. | 2405.12221 | null |
2024-05-20 | Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices | Nathaniel Cohen et.al. | 2405.12211 | link |
2024-05-20 | Cosmic Ray Diffusion in the Turbulent Interstellar Medium: Effects of Mirror Diffusion and Pitch Angle Scattering | Lucas Barreto-Mota et.al. | 2405.12146 | null |
2024-05-20 | Two-dimensional signal-dependent parabolic-elliptic Keller-Segel system and its means field derivation | Lukas Bol et.al. | 2405.12134 | null |
2024-05-20 | An effective advection induced by oscillating microstructure in a diffusion equation | David Wiedemann et.al. | 2405.12108 | null |
2024-05-20 | Sobolev regularity theory for stochastic reaction-diffusion-advection equations with spatially homogeneous colored noises and variable-order nonlocal operators | Jae-Hwan Choi et.al. | 2405.11969 | null |
2024-05-20 | Optimal balanced-norm error estimate of the LDG method for reaction-diffusion problems II: the two-dimensional case with layer-upwind flux | Yao Cheng et.al. | 2405.11939 | null |
2024-05-20 | Nonequilbrium physics of generative diffusion models | Zhendong Yu et.al. | 2405.11932 | null |
2024-05-20 | "Set It Up!": Functional Object Arrangement with Compositional Generative Models | Yiqing Xu et.al. | 2405.11928 | null |
2024-05-20 | Diff-BGM: A Diffusion Model for Video Background Music Generation | Sizhe Li et.al. | 2405.11913 | link |
2024-05-17 | Probabilistic transfer learning methodology to expedite high fidelity simulation of reactive flows | Bruno S. Soriano et.al. | 2405.10944 | null |
2024-05-17 | Reconstruction of Manipulated Garment with Guided Deformation Prior | Ren Li et.al. | 2405.10934 | null |
2024-05-17 | Limitations of the rate-distribution formalism in describing luminescence quenching in the presence of diffusion | Jakub Jędrak et.al. | 2405.10903 | null |
2024-05-17 | Improving face generation quality and prompt following with synthetic captions | Michail Tarasiou et.al. | 2405.10864 | null |
2024-05-17 | Diffusion Geometry | Iolo Jones et.al. | 2405.10858 | link |
2024-05-17 | Some remarks on a mathematical model for water flow in porous media with competition between transport and diffusion | Judita Runcziková et.al. | 2405.10751 | null |
2024-05-17 | Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse Problems | Hanyu Chen et.al. | 2405.10748 | link |
2024-05-17 | Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learning | Antoine Legouhy et.al. | 2405.10723 | link |
2024-05-17 | Numerical Recovery of the Diffusion Coefficient in Diffusion Equations from Terminal Measurement | Bangti Jin et.al. | 2405.10708 | null |
2024-05-17 | Ratchet-mediated resetting: Current, efficiency, and exact solution | Connor Roberts et.al. | 2405.10698 | null |
2024-05-16 | Text-to-Vector Generation with Neural Path Representation | Peiying Zhang et.al. | 2405.10317 | null |
2024-05-16 | Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model | Zheng Gu et.al. | 2405.10316 | null |
2024-05-16 | CAT3D: Create Anything in 3D with Multi-View Diffusion Models | Ruiqi Gao et.al. | 2405.10314 | null |
2024-05-16 | Societal Adaptation to Advanced AI | Jamie Bernardi et.al. | 2405.10295 | null |
2024-05-16 | Power-law relaxation of a confined diffusing particle subject to resetting with memory | Denis Boyer et.al. | 2405.10283 | null |
2024-05-16 | Interplay between Domain Walls in Type-II Superconductors and Gradients of Temperature/Spin Density | Takuma Kanakubo et.al. | 2405.10200 | null |
2024-05-16 | Fixed points of maps and nontrivial weak solutions to a class of nonlinear strongly coupled elliptic systems | Dung Le et.al. | 2405.10171 | null |
2024-05-16 | Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks | João Bordalo et.al. | 2405.10122 | null |
2024-05-16 | Advancing Set-Conditional Set Generation: Graph Diffusion for Fast Simulation of Reconstructed Particles | Dmitrii Kobylianskii et.al. | 2405.10106 | link |
2024-05-16 | Spurious reconstruction from brain activity | Ken Shirakawa et.al. | 2405.10078 | link |
2024-05-15 | MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer | Chengyu Wu et.al. | 2405.09539 | link |
2024-05-15 | A velocity-based moving mesh Discontinuous Galerkin method for the advection-diffusion equation | Ezra Rozier et.al. | 2405.09408 | null |
2024-05-15 | Probing particle acceleration in Abell 2256: from to 16 MHz to gamma rays | E. Osinga et.al. | 2405.09384 | null |
2024-05-15 | Diffusion-based Contrastive Learning for Sequential Recommendation | Ziqiang Cui et.al. | 2405.09369 | link |
2024-05-15 | DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations | Nima Fathi et.al. | 2405.09288 | link |
2024-05-15 | Searches for Galactic Neutrinos with the IceCube Neutrino observatory | A. Sandrock et.al. | 2405.09267 | null |
2024-05-15 | Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | Xuanchen Wang et.al. | 2405.09266 | null |
2024-05-15 | Exact analysis of the two-dimensional asymmetric simple exclusion process with attachment and detachment of particles | Yuki Ishiguro et.al. | 2405.09261 | null |
2024-05-15 | Propagation of chaos for moderately interacting particle systems related to singular kinetic Mckean-Vlasov SDEs | Zimo Hao et.al. | 2405.09195 | null |
2024-05-15 | QMedShield: A Novel Quantum Chaos-based Image Encryption Scheme for Secure Medical Image Storage in the Cloud | Arun Amaithi Rajan et.al. | 2405.09191 | null |
2024-05-14 | The Flux Hypothesis for Odd Transport Phenomena | Cory Hargus et.al. | 2405.08798 | null |
2024-05-14 | A Generalized Curvilinear Coordinate system-based Patch Dynamics Scheme in Equation-free Multiscale Modelling | Tanay Kumar Karmakar et.al. | 2405.08764 | null |
2024-05-14 | Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding | Zhimin Li et.al. | 2405.08748 | link |
2024-05-14 | Dimensionality reduction in bulk-boundary reaction-diffusion systems | Tom Burkart et.al. | 2405.08728 | null |
2024-05-14 | Design and Analysis of Resilient Vehicular Platoon Systems over Wireless Networks | Tingyu Shui et.al. | 2405.08706 | null |
2024-05-14 | Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models | Bingdong Li et.al. | 2405.08674 | null |
2024-05-14 | Quantum Circuit Model for Lattice Boltzmann Fluid Flow Simulations | Dinesh Kumar E et.al. | 2405.08669 | null |
2024-05-14 | Anomalous Landau damping and algebraic thermalization in two-dimensional superfluids far from equilibrium | Clément Duval et.al. | 2405.08606 | null |
2024-05-14 | PTPI-DL-ROMs: pre-trained physics-informed deep learning-based reduced order models for nonlinear parametrized PDEs | Simone Brivio et.al. | 2405.08558 | null |
2024-05-14 | Pedro De la Torre Luque et.al. | 2405.08482 | null | |
2024-05-13 | Cloaking for random walks using a discrete potential theory | Trent DeGiovanni et.al. | 2405.07961 | link |
2024-05-13 | Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data | Mahdi Morafah et.al. | 2405.07925 | null |
2024-05-13 | CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models | Nick Stracke et.al. | 2405.07913 | null |
2024-05-13 | Latest results from Super-Kamiokande | Andrew D. Santos et.al. | 2405.07900 | null |
2024-05-13 | Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging | Chi-en Amy Tai et.al. | 2405.07861 | null |
2024-05-13 | Radiogenomic biomarkers for immunotherapy in glioblastoma: A systematic review of magnetic resonance imaging studies | Prajwal Ghimire et.al. | 2405.07858 | null |
2024-05-13 | Using Multiparametric MRI with Optimized Synthetic Correlated Diffusion Imaging to Enhance Breast Cancer Pathologic Complete Response Prediction | Chi-en Amy Tai et.al. | 2405.07854 | null |
2024-05-13 | SAR Image Synthesis with Diffusion Models | Denisa Qosja et.al. | 2405.07776 | null |
2024-05-13 | LGDE: Local Graph-based Dictionary Expansion | Dominik J. Schindler et.al. | 2405.07764 | link |
2024-05-13 | FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation | Jianyi Chen et.al. | 2405.07682 | null |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547 | link |
2024-05-10 | Controllable Image Generation With Composed Parallel Token Prediction | Jamie Stirling et.al. | 2405.06535 | null |
2024-05-10 | SketchDream: Sketch-based Text-to-3D Generation and Editing | Feng-Lin Liu et.al. | 2405.06461 | null |
2024-05-10 | A universal phenomenology of charge-spin interconversion and dynamics in diffusive systems with spin-orbit coupling | Tim Kokkeler et.al. | 2405.06334 | null |
2024-05-10 | PUMA: margin-based data pruning | Javier Maroto et.al. | 2405.06298 | null |
2024-05-10 | Green's Function and Pointwise Space-time Behaviors of the Three-Dimensional Relativistic Boltzmann Equation | Yanchao Li et.al. | 2405.06280 | null |
2024-05-10 | Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging | Zhuchen Shao et.al. | 2405.06175 | null |
2024-05-10 | Integrability-preserving regularizations of Laplacian Growth | Razvan Teodorescu et.al. | 2405.06167 | null |
2024-05-10 | Dispersivity calculation in digital twins of multiscale porous materials using the micro-continuum approach | Julien Maes et.al. | 2405.06155 | null |
2024-05-09 | Modelling the random spreading of fake news through a two-dimensional time-inhomogeneous birth-death process | Antonio Di Crescenzo et.al. | 2405.06123 | null |
2024-05-09 | Distilling Diffusion Models into Conditional GANs | Minguk Kang et.al. | 2405.05967 | null |
2024-05-09 | Towards comprehensive coverage of chemical space: Quantum mechanical properties of 836k constitutional and conformational closed shell neutral isomers consisting of HCNOFSiPSClBr | Danish Khan et.al. | 2405.05961 | null |
2024-05-09 | Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask | Zineb Senane et.al. | 2405.05959 | link |
2024-05-09 | Frame Interpolation with Consecutive Brownian Bridge Diffusion | Zonglin Lyu et.al. | 2405.05953 | link |
2024-05-09 | Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers | Peng Gao et.al. | 2405.05945 | link |
2024-05-09 | Composable Part-Based Manipulation | Weiyu Liu et.al. | 2405.05876 | null |
2024-05-09 | Parameter identification for an uncertain reaction-diffusion equation via setpoint regulation | Gildas Besançon et.al. | 2405.05866 | null |
2024-05-09 | Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control | Gunshi Gupta et.al. | 2405.05852 | link |
2024-05-09 | Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models | Zhe Ma et.al. | 2405.05846 | link |
2024-05-09 | MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction | Pinhuang Tan et.al. | 2405.05814 | null |
2024-05-08 | Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo | Nayantara Mudur et.al. | 2405.05255 | link |
2024-05-08 | Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models | Hongjie Wang et.al. | 2405.05252 | null |
2024-05-08 | Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation | Jonas Kohler et.al. | 2405.05224 | null |
2024-05-08 | FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models | Jinglin Xu et.al. | 2405.05216 | link |
2024-05-08 | An adaptive finite element multigrid solver using GPU acceleration | Manuel Liebchen et.al. | 2405.05047 | null |
2024-05-08 | Reviewing Intelligent Cinematography: AI research for camera-based video production | Adrian Azzarelli et.al. | 2405.05039 | null |
2024-05-08 | Monitoring of neoadjuvant chemotherapy through time domain diffuse optics: Breast tissue composition changes and collagen discriminative potential | Nikhitha Mule et.al. | 2405.05035 | null |
2024-05-08 | An anti-noise seismic inversion method based on diffusion model | Yingtian Liu et.al. | 2405.05026 | link |
2024-05-08 | Stochastic spatial Lotka-Volterra predator-prey models | Uwe C. Täuber et.al. | 2405.05006 | null |
2024-05-08 | A unified theory of the self-similar supersonic Marshak wave problem | Menahem Krief et.al. | 2405.04981 | null |
2024-05-07 | Tactile-Augmented Radiance Fields | Yiming Dou et.al. | 2405.04534 | link |
2024-05-07 | Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing | Yi Zuo et.al. | 2405.04496 | null |
2024-05-07 | CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model | Haixia Xiao et.al. | 2405.04483 | null |
2024-05-07 | Derivation of kinetic and diffusion equations from a hard-sphere Rayleigh gas using collision trees and semigroups | Karsten Matthies et.al. | 2405.04449 | null |
2024-05-07 | Brownian Motion on The Spider Like Quantum Graphs | Madhumita Paul et.al. | 2405.04439 | null |
2024-05-07 | Learning local Dirichlet-to-Neumann maps of nonlinear elliptic PDEs with rough coefficients | Miranda Boutilier et.al. | 2405.04433 | null |
2024-05-07 | Josephson threshold detector in the phase diffusion regime | Dmitry A. Ladeynov et.al. | 2405.04426 | null |
2024-05-07 | Mathematical Modeling of $^{18}$F-Fluoromisonidazole ( |
Mohammad Amin Abazari et.al. | 2405.04418 | null |
2024-05-07 | Community Detection for Heterogeneous Multiple Social Networks | Ziqing Zhu et.al. | 2405.04371 | null |
2024-05-07 | Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos | Junyi Ma et.al. | 2405.04370 | link |
2024-05-06 | An Empty Room is All We Want: Automatic Defurnishing of Indoor Panoramas | Mira Slavcheva et.al. | 2405.03682 | null |
2024-05-06 | Field-of-View Extension for Diffusion MRI via Deep Generative Models | Chenyu Gao et.al. | 2405.03652 | null |
2024-05-06 | Cosine Annealing Optimized Denoising Diffusion Error Correction Codes | Congyang Ou et.al. | 2405.03638 | null |
2024-05-06 | Strang Splitting for Parametric Inference in Second-order Stochastic Differential Equations | Predrag Pilipovic et.al. | 2405.03606 | null |
2024-05-06 | Dissipative gradient nonlinearities prevent |
Tongxing Li et.al. | 2405.03586 | null |
2024-05-06 | Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models | Ludwig Winkler et.al. | 2405.03549 | null |
2024-05-06 | CCDM: Continuous Conditional Diffusion Models for Image Generation | Xin Ding et.al. | 2405.03546 | link |
2024-05-06 | Asymptotic-preserving hybridizable discontinuous Galerkin method for the Westervelt quasilinear wave equation | Sergio Gómez et.al. | 2405.03535 | null |
2024-05-06 | Quasi-Monte Carlo for Bayesian design of experiment problems governed by parametric PDEs | Vesa Kaarnioja et.al. | 2405.03529 | null |
2024-05-06 | On anomalous dissipation induced by transport noise | Antonio Agresti et.al. | 2405.03525 | null |
2024-05-03 | DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos | Wen-Hsuan Chu et.al. | 2405.02280 | link |
2024-05-03 | Relic gravitons and non-stationary processes | Massimo Giovannini et.al. | 2405.02193 | null |
2024-05-03 | Tangentially Active Polymers in Cylindrical Channels | José Martín-Roca et.al. | 2405.02192 | null |
2024-05-03 | Characterized Diffusion and Spatial-Temporal Interaction Network for Trajectory Prediction in Autonomous Driving | Haicheng Liao et.al. | 2405.02145 | null |
2024-05-03 | Global regularity and infinite Prandtl number limit of temperature patches for the 2D Boussinesq system | Omar Lazar et.al. | 2405.02137 | null |
2024-05-03 | Multi-grid reaction-diffusion master equation: applications to morphogen gradient modelling | Radek Erban et.al. | 2405.02117 | null |
2024-05-03 | On variable annuities with surrender charges | Tiziano De Angelis et.al. | 2405.02115 | null |
2024-05-03 | Anomalous transport in the quantum East-West kinetically constrained model | Pietro Brighi et.al. | 2405.02102 | null |
2024-05-03 | Radiative and mechanical energies in galaxies I. Contributions of molecular shocks and PDRs in 3C 326 N | J. A. Villa-Vélez et.al. | 2405.02058 | null |
2024-05-03 | The CO-dark molecular gas in the cold HI arc | Gan Luo et.al. | 2405.02055 | null |
2024-05-02 | Customizing Text-to-Image Models with a Single Image Pair | Maxwell Jones et.al. | 2405.01536 | null |
2024-05-02 | The heat equation with time-correlated random potential in d=2: Edwards-Wilkinson fluctuations | Sotirios Kotitsas et.al. | 2405.01519 | null |
2024-05-02 | Effective Lifshitz black holes, hydrodynamics, and transport coefficients in fluid/gravity correspondence | D. C. Moreira et.al. | 2405.01505 | null |
2024-05-02 | LocInv: Localization-aware Inversion for Text-Guided Image Editing | Chuanming Tang et.al. | 2405.01496 | link |
2024-05-02 | Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models | Matias Mendieta et.al. | 2405.01494 | null |
2024-05-02 | StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation | Yupeng Zhou et.al. | 2405.01434 | link |
2024-05-02 | In-and-Out: Algorithmic Diffusion for Sampling Convex Bodies | Yunbum Kook et.al. | 2405.01425 | null |
2024-05-02 | Statistical algorithms for low-frequency diffusion data: A PDE approach | Matteo Giordano et.al. | 2405.01372 | link |
2024-05-02 | On Nanowire Morphological Instability and Pinch-Off by Surface Electromigration | Mikhail Khenner et.al. | 2405.01331 | null |
2024-05-02 | DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines | Ye Tian et.al. | 2405.01248 | null |
2024-05-01 | TexSliders: Diffusion-Based Texture Editing in CLIP Space | Julia Guerrero-Viu et.al. | 2405.00672 | null |
2024-05-01 | RGB |
Zheng Zeng et.al. | 2405.00666 | null |
2024-05-01 | Large deviations of current for the symmetric simple exclusion process on a semi-infinite line and on an infinite line with slow bonds | Kapil Sharma et.al. | 2405.00654 | null |
2024-05-01 | Stochastic fluids with transport noise: Approximating diffusion from data using SVD and ensemble forecast back-propagation | James Woodfield et.al. | 2405.00640 | null |
2024-05-01 | Vacancy-mediated transport and segregation tendencies of solutes in FCC nickel under diffusional creep: A density functional theory study | Shehab Shousha et.al. | 2405.00639 | null |
2024-05-01 | Engine-fed Kilonovae (Mergernovae) -- II. Radiation | Shunke Ai et.al. | 2405.00638 | null |
2024-05-01 | Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure | Assefa Seyoum Wahd et.al. | 2405.00631 | null |
2024-05-01 | Hysteresis and Self-Oscillations in an Artificial Memristive Quantum Neuron | Finlay Potter et.al. | 2405.00624 | null |
2024-05-01 | Lane Segmentation Refinement with Diffusion Models | Antonio Ruiz et.al. | 2405.00620 | null |
2024-05-01 | Anomalous diffusion and factor ordering in (1+1)-dimensional Lorentzian quantum gravity | Elijah Sanderson et.al. | 2405.00594 | null |
2024-04-30 | MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model | Wenxun Dai et.al. | 2404.19759 | link |
2024-04-30 | Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting | Paul Engstler et.al. | 2404.19758 | null |
2024-04-30 | Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation | Ian Dunn et.al. | 2404.19739 | link |
2024-04-30 | Investigating the correlations between IceCube high-energy neutrinos and Fermi-LAT |
Ming-Xuan Lu et.al. | 2404.19730 | null |
2024-04-30 | X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models | Emmanuelle Bourigault et.al. | 2404.19604 | null |
2024-04-30 | Cool-core, X-ray cavities and cold front revealed in RXCJ0352.9+1941 cluster by Chandra and GMRT observations | Satish S. Sonkamble et.al. | 2404.19549 | null |
2024-04-30 | Shocks in the Warm Neutral Medium I -- Theoretical model | Benjamin Godard et.al. | 2404.19533 | null |
2024-04-30 | MicroDreamer: Zero-shot 3D Generation in |
Luxi Chen et.al. | 2404.19525 | link |
2024-04-30 | Well-posedness of McKean-Vlasov SDEs with density-dependent drift | Anh-Dung Le et.al. | 2404.19499 | null |
2024-04-30 | TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models | Teng Zhou et.al. | 2404.19475 | link |
2024-04-29 | Stylus: Automatic Adapter Selection for Diffusion Models | Michael Luo et.al. | 2404.18928 | null |
2024-04-29 | TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation | Junhao Cheng et.al. | 2404.18919 | link |
2024-04-29 | Learning general Gaussian mixtures with efficient score matching | Sitan Chen et.al. | 2404.18893 | null |
2024-04-29 | A Survey on Diffusion Models for Time Series and Spatio-Temporal Data | Yiyuan Yang et.al. | 2404.18886 | link |
2024-04-29 | Learning Mixtures of Gaussians Using Diffusion Models | Khashayar Gatmiry et.al. | 2404.18869 | null |
2024-04-29 | Construction of local reduced spaces for Friedrichs' systems via randomized training | Christian Engwer et.al. | 2404.18839 | null |
2024-04-29 | Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Zhiyuan Li et.al. | 2404.18820 | link |
2024-04-29 | Spectral measures and iterative bounds for effective diffusivity of steady and space-time periodic flows | N. B. Murphy et.al. | 2404.18754 | null |
2024-04-29 | Diffuse scattering from dynamically compressed single-crystal zirconium following the pressure-induced |
P. G. Heighway et.al. | 2404.18740 | null |
2024-04-29 | Diffusion coefficient matrix for multiple conserved charges: a Kubo approach | Sourav Dey et.al. | 2404.18718 | null |
2024-04-26 | Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos | Zhengze Xu et.al. | 2404.17571 | null |
2024-04-26 | MaPa: Text-driven Photorealistic Material Painting for 3D Shapes | Shangzhan Zhang et.al. | 2404.17569 | null |
2024-04-26 | [OI] fine structure line profiles in Mon R2 and M17 SW: the puzzling nature of cold foreground material identified by [12CII] self-absorption | C. Guevara et.al. | 2404.17538 | null |
2024-04-26 | Reduction of the effective population size in a branching particle system in the moderate mutation-selection regime | Florin Boenkost et.al. | 2404.17527 | null |
2024-04-26 | Chemotaxis-inspired PDE model for airborne infectious disease transmission: analysis and simulations | Pierluigi Colli et.al. | 2404.17506 | null |
2024-04-26 | TextGaze: Gaze-Controllable Face Generation with Natural Language | Hengfei Wang et.al. | 2404.17486 | null |
2024-04-26 | Consistent Second Moment Methods with Scalable Linear Solvers for Radiation Transport | Samuel Olivier et.al. | 2404.17473 | null |
2024-04-26 | Quasi particle model vs lattice QCD thermodynamics: extension to |
Maria Lucia Sambataro et.al. | 2404.17459 | null |
2024-04-26 | Vaporization dynamics of a super-heated water-in-oil droplet: modeling and numerical solution | Muhammad Saeed Saleem et.al. | 2404.17457 | null |
2024-04-26 | Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation | Seungwook Kim et.al. | 2404.17419 | null |
2024-04-25 | The Third Monocular Depth Estimation Challenge | Jaime Spencer et.al. | 2404.16831 | null |
2024-04-25 | Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials | Ye Fang et.al. | 2404.16829 | null |
2024-04-25 | ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving | Jiehui Huang et.al. | 2404.16771 | link |
2024-04-25 | Analysis of Ethanol Blending Effects on Auto-Ignition and Heat Release in n-Heptane/Ethanol Non-Premixed Flames | Liang Ji et.al. | 2404.16762 | null |
2024-04-25 | Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior | Han Wang et.al. | 2404.16678 | null |
2024-04-25 | The First Estimation of the Ambipolar Diffusivity Coefficient from Multi-Scale Observations of the Class 0/I Protostar, HOPS-370 | Travis J. Thieme et.al. | 2404.16668 | null |
2024-04-25 | Inferring solid-state diffusivity in lithium-ion battery active materials: improving upon the classical GITT method | A. Emir Gumrukcuoglu et.al. | 2404.16658 | null |
2024-04-25 | Denoising: from classical methods to deep CNNs | Jean-Eric Campagne et.al. | 2404.16617 | link |
2024-04-25 | Stochastic Dissipative Euler's equations for a free body | J. A. de la Torre et.al. | 2404.16613 | null |
2024-04-25 | MuseumMaker: Continual Style Customization without Catastrophic Forgetting | Chenxi Liu et.al. | 2404.16612 | null |
2024-04-24 | Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models | Xu Shen et.al. | 2404.15625 | null |
2024-04-24 | A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution | Zhixiong Yang et.al. | 2404.15620 | link |
2024-04-23 | Measuring topological constraint relaxation in ring-linear polymer blends | Daniel L. Vigil et.al. | 2404.15560 | null |
2024-04-23 | Thermal boundary conductance of sharp metal-diamond interfaces predicted by machine learning molecular dynamics | Khalid Zobaid Adnan et.al. | 2404.15465 | null |
2024-04-23 | ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning | Weifeng Chen et.al. | 2404.15449 | null |
2024-04-23 | GLoD: Composing Global Contexts and Local Details in Image Generation | Moyuru Yamada et.al. | 2404.15447 | null |
2024-04-23 | Thermal boundary conductance and thermal conductivity strongly depend on nearby environment | Khalid Zobaid Adnan et.al. | 2404.15439 | null |
2024-04-23 | ID-Animator: Zero-Shot Identity-Preserving Human Video Generation | Xuanhua He et.al. | 2404.15275 | link |
2024-04-23 | From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation | Zehuan Huang et.al. | 2404.15267 | link |
2024-04-23 | Score matching for sub-Riemannian bridge sampling | Erlend Grong et.al. | 2404.15258 | null |
2024-04-23 | Nucleation mechanism of multiple-order parameter ferroelectric domain wall motion in hafnia | Songsong Zhou et.al. | 2404.15251 | null |
2024-04-23 | Local well-posedness for a novel nonlocal model for cell-cell adhesion via receptor binding | Mabel Lizzy Rajendran et.al. | 2404.15222 | null |
2024-04-23 | Heat flow, log-concavity, and Lipschitz transport maps | Giovanni Brigati et.al. | 2404.15205 | null |
2024-04-23 | Signature of Particle Diffusion on the X-ray Spectra of the blazar Mkn 421 | C. Baheeja et.al. | 2404.15171 | null |
2024-04-23 | A general multi-wave quasi-resonance theory for lattice energy diffusion | Wei Lin et.al. | 2404.15147 | null |
2024-04-23 | CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method | Mingbao Lin et.al. | 2404.15141 | link |
2024-04-23 | Taming Diffusion Probabilistic Models for Character Control | Rui Chen et.al. | 2404.15121 | null |
2024-04-22 | Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses | Inhee Lee et.al. | 2404.14410 | null |
2024-04-22 | GeoDiffuser: Geometry-Based Image Editing with Diffusion Models | Rahul Sajnani et.al. | 2404.14403 | null |
2024-04-22 | Observational characterisation of large-scale transport and horizontal turbulent diffusivity in the quiet Sun | F. Rincon et.al. | 2404.14383 | null |
2024-04-22 | TAVGBench: Benchmarking Text to Audible-Video Generation | Yuxin Mao et.al. | 2404.14381 | link |
2024-04-22 | Temporal Entanglement Profiles in Dual-Unitary Clifford Circuits with Measurements | Jiangtian Yao et.al. | 2404.14374 | null |
2024-04-22 | Operando Analysis of Adsorption-Limited Hydrogen Oxidation Reaction at Palladium Surfaces | Yukun Liu et.al. | 2404.14348 | null |
2024-04-22 | Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion | Alexander Shmakov et.al. | 2404.14332 | null |
2024-04-22 | X-Ray: A Sequential 3D Representation for Generation | Tao Hu et.al. | 2404.14329 | link |
2024-04-22 | Towards Better Adversarial Purification via Adversarial Denoising Diffusion Training | Yiming Liu et.al. | 2404.14309 | null |
2024-04-22 | Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity | Yu Hou et.al. | 2404.14240 | link |
2024-04-19 | Analysis of Classifier-Free Guidance Weight Schedulers | Xi Wang et.al. | 2404.13040 | null |
2024-04-19 | A multigrain-multilayer astrochemical model with variable desorption energy for surface species | Juris Kalvans et.al. | 2404.13011 | null |
2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | null |
2024-04-19 | Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics | Xiaofei Wang et.al. | 2404.12973 | null |
2024-04-19 | On the McKean-Vlasov SDE with branching | Julien Claisse et.al. | 2404.12964 | null |
2024-04-19 | Robust hybrid finite element methods for reaction-dominated diffusion problems | Thomas Führer et.al. | 2404.12956 | null |
2024-04-19 | Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling | Grigory Bartosh et.al. | 2404.12940 | null |
2024-04-19 | Diffusive contact between randomly driven colloidal suspensions | Galor Geva et.al. | 2404.12929 | null |
2024-04-19 | Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models | Konstantinos Vilouras et.al. | 2404.12920 | null |
2024-04-19 | Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images | Santosh et.al. | 2404.12908 | link |
2024-04-18 | G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis | Yufei Ye et.al. | 2404.12383 | null |
2024-04-18 | Lazy Diffusion Transformer for Interactive Image Editing | Yotam Nitzan et.al. | 2404.12382 | null |
2024-04-18 | Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models | Trevor J. Chan et.al. | 2404.12361 | null |
2024-04-18 | AniClipart: Clipart Animation with Text-to-Video Priors | Ronghuan Wu et.al. | 2404.12347 | null |
2024-04-18 | Customizing Text-to-Image Diffusion with Camera Viewpoint Control | Nupur Kumari et.al. | 2404.12333 | null |
2024-04-18 | Guided Discrete Diffusion for Electronic Health Record Generation | Zixiang Chen et.al. | 2404.12314 | null |
2024-04-18 | Investigation of Spin-Pumping and -Transport in the Ni80Fe20/Pt/Co Asymmetric Trilayer | Shilpa Samdani et.al. | 2404.12307 | null |
2024-04-18 | RISE: 3D Perception Makes Real-World Robot Imitation Simple and Effective | Chenxi Wang et.al. | 2404.12281 | null |
2024-04-18 | A New Computational Method for Energetic Particle Acceleration and Transport with its Feedback | Jeongbhin Seo et.al. | 2404.12276 | null |
2024-04-18 | Tree-Based Nonlinear Reduced Modeling | Diane Guignard et.al. | 2404.12262 | null |
2024-04-17 | Factorized Diffusion: Perceptual Illusions by Noise Decomposition | Daniel Geng et.al. | 2404.11615 | null |
2024-04-17 | InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior | Zhiheng Liu et.al. | 2404.11613 | null |
2024-04-17 | IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination | Xi Chen et.al. | 2404.11593 | null |
2024-04-17 | Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding | Zezhong Fan et.al. | 2404.11589 | null |
2024-04-17 | Emulators for scarce and noisy data: application to auxiliary field diffusion Monte Carlo for the deuteron | Rahul Somasundaram et.al. | 2404.11566 | null |
2024-04-17 | MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation | Kuan-Chieh et.al. | 2404.11565 | null |
2024-04-17 | Predicting Long-horizon Futures by Conditioning on Geometry and Time | Tarasha Khurana et.al. | 2404.11554 | null |
2024-04-17 | A Bayesian level-set inversion method for simultaneous reconstruction of absorption and diffusion coefficients in diffuse optical tomography | Anuj Abhishek et.al. | 2404.11552 | null |
2024-04-17 | SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening | Yu Zhong et.al. | 2404.11537 | null |
2024-04-17 | Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt | Zhanjie Zhang et.al. | 2404.11474 | link |
2024-04-16 | Searching for cold gas traced by MgII quasar absorbers in massive X-ray-selected galaxy clusters | A. Y. Fresco et.al. | 2404.10773 | null |
2024-04-16 | RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting | Ashkan Mirzaei et.al. | 2404.10765 | null |
2024-04-16 | LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? | Yuchi Wang et.al. | 2404.10763 | link |
2024-04-16 | A High-Order Conservative Cut Finite Element Method for Problems in Time-Dependent Domains | Sebastian Myrbäck et.al. | 2404.10756 | link |
2024-04-16 | GazeHTA: End-to-end Gaze Target Detection with Head-Target Association | Zhi-Yi Lin et.al. | 2404.10718 | null |
2024-04-16 | Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution | Yutao Yuan et.al. | 2404.10688 | link |
2024-04-16 | Generating Human Interaction Motions in Scenes with Text Control | Hongwei Yi et.al. | 2404.10685 | null |
2024-04-16 | StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization | Yingshu Chen et.al. | 2404.10681 | null |
2024-04-16 | Arsenic diffusion in MOVPE-Grown GaAs/Ge epitaxial structures | V. Orejuela et.al. | 2404.10669 | null |
2024-04-16 | Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay | Jinmei Liu et.al. | 2404.10662 | link |
2024-04-15 | Accurate quantum Monte Carlo forces for machine-learned force fields: Ethanol as a benchmark | Emiel Slootman et.al. | 2404.09755 | null |
2024-04-15 | Electric potential during tokamak disruptions and steady-state current drive | Allen H Boozer et.al. | 2404.09744 | null |
2024-04-15 | Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement | Wenyi Lian et.al. | 2404.09735 | link |
2024-04-15 | Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models | Ziwei Luo et.al. | 2404.09732 | link |
2024-04-15 | Structure and dynamics of active string fluids and gels formed by dipolar active Brownian particles | Maria Kelidou et.al. | 2404.09693 | null |
2024-04-15 | Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis | Alessa Hering et.al. | 2404.09666 | null |
2024-04-15 | Impact of chirality on active Brownian particle: Exact moments in two and three dimensions | Anweshika Pattanayak et.al. | 2404.09650 | null |
2024-04-15 | All-in-one simulation-based inference | Manuel Gloeckler et.al. | 2404.09636 | link |
2024-04-15 | Branching diffusion processes and spectral properties of Feynman-Kac semigroup | Pierre Collet et.al. | 2404.09568 | null |
2024-04-15 | Entropy on the Path Space and Application to Singular Diffusions and Mean-field Models | Patrick Cattiaux et.al. | 2404.09552 | null |
2024-04-15 | Turbulent ice-ocean boundary layers in the well-mixed regime: insights from direct numerical simulations | Louis-Alexandre Couston et.al. | 2404.09545 | null |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | Functional reducibility of higher-order networks | Maxime Lucas et.al. | 2404.08547 | link |
2024-04-12 | Echoes of darkness: Supernova-neutrino-boosted dark matter from all galaxies | Yen-Hsun Lin et.al. | 2404.08528 | link |
2024-04-12 | Generalized Hydrodynamics for the Volterra lattice: Ballistic and nonballistic behavior of correlation functions | Guido Mazzuca et.al. | 2404.08499 | null |
2024-04-12 | PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction | Siming Shan et.al. | 2404.08412 | null |
2024-04-12 | Estimate of force noise from electrostatic patch potentials in LISA Pathfinder | Stefano Vitale et.al. | 2404.08340 | null |
2024-04-12 | Struggle with Adversarial Defense? Try Diffusion | Yujie Li et.al. | 2404.08273 | link |
2024-04-12 | An XRISM observation proposal: Gas velocity in the merging cluster Abell 2256 | Takayuki Tamura et.al. | 2404.08267 | null |
2024-04-12 | Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models | Zeyu Yang et.al. | 2404.08254 | link |
2024-04-12 | An Asymptotically-Correct Implicit-Explicit Time Integration Scheme for Finite Volume Radiation-Hydrodynamics | Chong-Chong He et.al. | 2404.08247 | link |
2024-04-11 | OpenBias: Open-set Bias Detection in Text-to-Image Generative Models | Moreno D'Incà et.al. | 2404.07990 | link |
2024-04-11 | ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback | Ming Li et.al. | 2404.07987 | link |
2024-04-11 | View Selection for 3D Captioning via Diffusion Ranking | Tiange Luo et.al. | 2404.07984 | null |
2024-04-11 | Taming Stable Diffusion for Text to 360° Panorama Image Generation | Cheng Zhang et.al. | 2404.07949 | link |
2024-04-11 | Active Carpets in floating viscous films | Felipe A. Barros et.al. | 2404.07856 | null |
2024-04-11 | Adaptive Hyperbolic-cross-space Mapped Jacobi Method on Unbounded Domains with Applications to Solving Multidimensional Spatiotemporal Integrodifferential Equations | Yunhong Deng et.al. | 2404.07844 | null |
2024-04-11 | The Cattaneo-Christov approximation of Fourier heat-conductive compressible fluids | Timothée Crin-Barat et.al. | 2404.07809 | null |
2024-04-11 | ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model | Lifan Jiang et.al. | 2404.07773 | link |
2024-04-11 | An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization | Minshuo Chen et.al. | 2404.07771 | null |
2024-04-11 | Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations | Yufeng Yue et.al. | 2404.07770 | null |
2024-04-10 | GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models | Zewei Zhang et.al. | 2404.07206 | null |
2024-04-10 | RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion | Jaidev Shriram et.al. | 2404.07199 | null |
2024-04-10 | InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models | Jiale Xu et.al. | 2404.07191 | link |
2024-04-10 | Move Anything with Layered Scene Diffusion | Jiawei Ren et.al. | 2404.07178 | null |
2024-04-10 | Understanding Dynamics in Coarse-Grained Models: IV. Connection of Fine-Grained and Coarse-Grained Dynamics with the Stokes-Einstein and Stokes-Einstein-Debye Relations | Jaehyeok Jin et.al. | 2404.07156 | null |
2024-04-10 | A conservative Eulerian finite element method for transport and diffusion in moving domains | Maxim Olshanskii et.al. | 2404.07130 | link |
2024-04-10 | Open reaction-diffusion systems: bridging probabilistic theory across scales | Mauricio J. del Razo et.al. | 2404.07119 | null |
2024-04-10 | Diffusion-based inpainting of incomplete Euclidean distance matrices of trajectories generated by a fractional Brownian motion | Alexander Lobashev et.al. | 2404.07029 | link |
2024-04-10 | On the conjugate interface conditions and Galilean invariance | Yang Hu et.al. | 2404.07025 | null |
2024-04-10 | Non-Degenerate One-Time Pad and the integrity of perfectly secret messages | Alex Shafarenko et.al. | 2404.07022 | null |
2024-04-09 | Convergence analysis of novel discontinuous Galerkin methods for a convection dominated problem | Satyajith Bommana Boyana et.al. | 2404.06490 | null |
2024-04-09 | Uncovering Tidal Treasures: Automated Classification of Faint Tidal Features in DECaLS Data | Alexander J. Gordon et.al. | 2404.06487 | link |
2024-04-09 | GeoDirDock: Guiding Docking Along Geodesic Paths | Raúl Miñán et.al. | 2404.06481 | null |
2024-04-09 | Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion | Fan Yang et.al. | 2404.06429 | link |
2024-04-09 | ZeST: Zero-Shot Material Transfer from a Single Image | Ta-Ying Cheng et.al. | 2404.06425 | null |
2024-04-09 | Policy-Guided Diffusion | Matthew Thomas Jackson et.al. | 2404.06356 | link |
2024-04-09 | Quantum State Generation with Structure-Preserving Diffusion Model | Yuchen Zhu et.al. | 2404.06336 | null |
2024-04-09 | Compensating slice emittance growth in high brightness photoinjectors using sacrificial charge | W. H. Li et.al. | 2404.06312 | null |
2024-04-09 | NoiseNCA: Noisy Seed Improves Spatio-Temporal Continuity of Neural Cellular Automata | Ehsan Pajouheshgar et.al. | 2404.06279 | null |
2024-04-09 | A Large-Scale Simulation Method for Neuromorphic Circuits | Amir Shahhosseini et.al. | 2404.06255 | null |
2024-04-08 | The neutrino background from non-jetted active galactic nuclei | P. Padovani et.al. | 2404.05690 | null |
2024-04-08 | MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Kunpeng Song et.al. | 2404.05674 | link |
2024-04-08 | NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement | Giordano Cicchetti et.al. | 2404.05669 | link |
2024-04-08 | YaART: Yet Another ART Rendering Technology | Sergey Kastryulin et.al. | 2404.05666 | null |
2024-04-08 | BinaryDM: Towards Accurate Binarization of Diffusion Model | Xingyu Zheng et.al. | 2404.05662 | link |
2024-04-08 | Convergence rates for the finite volume scheme of the stochastic heat equation | Niklas Sapountzoglou et.al. | 2404.05655 | null |
2024-04-09 | The persistence of high altitude non-equilibrium diffuse ionized gas in simulations of star forming galaxies | Lewis McCallum et.al. | 2404.05651 | null |
2024-04-08 | Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model | Jichang Yang et.al. | 2404.05648 | link |
2024-04-08 | eDIG-CHANGES II: Project Design and Initial Results on NGC 3556 | Jiang-Tao Li et.al. | 2404.05628 | null |
2024-04-08 | Learning a Category-level Object Pose Estimator without Pose Annotations | Fengrui Tian et.al. | 2404.05626 | null |
2024-04-05 | Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models | Sangwon Jang et.al. | 2404.04243 | null |
2024-04-05 | ToolEENet: Tool Affordance 6D Pose Estimation | Yunlong Wang et.al. | 2404.04193 | null |
2024-04-05 | Nonlocally coupled moisture model for convective self-aggregation | Tomoro Yanase et.al. | 2404.04146 | null |
2024-04-05 | Rare events, time crystals and symmetry-breaking dynamical phase transitions | Rubén Hurtado-Gutiérrez et.al. | 2404.04135 | null |
2024-04-05 | A posteriori error analysis of a space-time hybridizable discontinuous Galerkin method for the advection-diffusion problem | Yuan Wang et.al. | 2404.04130 | null |
2024-04-05 | Dynamic Prompt Optimizing for Text-to-Image Generation | Wenyi Mo et.al. | 2404.04095 | link |
2024-04-05 | A first passage model of intravitreal drug delivery and residence time, in relation to ocular geometry, individual variability, and injection location | Patricia Lamirande et.al. | 2404.04086 | null |
2024-04-05 | Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation | Mingyuan Zhou et.al. | 2404.04057 | link |
2024-04-05 | InstructHumans: Editing Animated 3D Human Textures with Instructions | Jiayin Zhu et.al. | 2404.04037 | null |
2024-04-05 | Impacts of non-thermal emission on the images of black hole shadow and extended jets in two-temperature GRMHD simulations | Mingyuan Zhang et.al. | 2404.04033 | null |
2024-04-04 | MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation | Hanzhe Hu et.al. | 2404.03656 | null |
2024-04-04 | CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching | Dongzhi Jiang et.al. | 2404.03653 | link |
2024-04-04 | The More You See in 2D, the More You Perceive in 3D | Xinyang Han et.al. | 2404.03652 | null |
2024-04-04 | DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior | Yiming Zhang et.al. | 2404.03642 | null |
2024-04-04 | LCM-Lookahead for Encoder-based Text-to-Image Personalization | Rinon Gal et.al. | 2404.03620 | null |
2024-04-04 | DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images | Zhou Jie et.al. | 2404.03595 | link |
2024-04-04 | PointInfinity: Resolution-Invariant Point Diffusion Models | Zixuan Huang et.al. | 2404.03566 | null |
2024-04-04 | Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models | Siyuan Mei et.al. | 2404.03541 | null |
2024-04-04 | Impact of the Magnetic Horizon on the Interpretation of the Pierre Auger Observatory Spectrum and Composition Data | The Pierre Auger Collaboration et.al. | 2404.03533 | null |
2024-04-04 | Significantly Enhanced Vacancy Diffusion in Mn-containing Alloys | Huaqing Guan et.al. | 2404.03339 | null |
2024-04-03 | Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Keyu Tian et.al. | 2404.02905 | link |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903 | link |
2024-04-03 | MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment | Duygu Ceylan et.al. | 2404.02899 | null |
2024-04-03 | On the Scalability of Diffusion-based Text-to-Image Generation | Hao Li et.al. | 2404.02883 | null |
2024-04-03 | Uniqueness of the blow-down limit for triple junction problem | Zhiyuan Geng et.al. | 2404.02859 | null |
2024-04-03 | Efficient Quantum Circuits for Non-Unitary and Unitary Diagonal Operators with Space-Time-Accuracy trade-offs | Julien Zylberman et.al. | 2404.02819 | null |
2024-04-03 | Fast Diffusion Model For Seismic Data Noise Attenuation | Junheng Peng et.al. | 2404.02767 | null |
2024-04-03 | Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models | Wentian Zhang et.al. | 2404.02747 | link |
2024-04-03 | InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation | Haofan Wang et.al. | 2404.02733 | link |
2024-04-03 | Harnessing the Power of Large Vision Language Models for Synthetic Image Detection | Mamadou Keita et.al. | 2404.02726 | link |
2024-04-02 | Diffusion |
Zeyu Yang et.al. | 2404.02148 | link |
2024-04-02 | A Stabilized Parametric Finite Element Method for Surface Diffusion with an Arbitrary Surface Energy | Yulin Zhang et.al. | 2404.02083 | null |
2024-04-02 | WcDT: World-centric Diffusion Transformer for Traffic Scene Generation | Chen Yang et.al. | 2404.02082 | link |
2024-04-02 | Brownian Particles and Matter Waves | Nicos Makris et.al. | 2404.02016 | null |
2024-04-02 | Superionic Fluoride Gate Dielectrics with Low Diffusion Barrier for Advanced Electronics | Kui Meng et.al. | 2404.02011 | null |
2024-04-02 | AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design | Xinze Li et.al. | 2404.02003 | null |
2024-04-02 | Rigorous derivation of an effective model for coupled Stokes advection, reaction and diffusion with freely evolving microstructure | Markus Gahn et.al. | 2404.01983 | null |
2024-04-02 | Bi-LORA: A Vision-Language Approach for Synthetic Image Detection | Mamadou Keita et.al. | 2404.01959 | link |
2024-04-02 | Nonlinear stability for active suspensions | Helge Dietert et.al. | 2404.01906 | null |
2024-04-02 | On the surface helium abundance of B-type hot subdwarf stars from the WD+MS channel of Type Ia supernovae | Rui-Jie Ji et.al. | 2404.01905 | null |
2024-03-29 | Relation Rectification in Diffusion Model | Yinwei Wu et.al. | 2403.20249 | null |
2024-03-29 | Graph Neural Aggregation-diffusion with Metastability | Kaiyuan Cui et.al. | 2403.20221 | null |
2024-03-29 | Scaled Brownian motion with random anomalous diffusion exponent | Hubert Woszczek et.al. | 2403.20206 | null |
2024-03-29 | Motion Inversion for Video Customization | Luozhou Wang et.al. | 2403.20193 | null |
2024-03-29 | Energy solutions of the Cauchy-Dirichlet problem for fractional nonlinear diffusion equations | Goro Akagi et.al. | 2403.20176 | null |
2024-03-29 | Na Vacancy Driven Phase Transformation and Fast Ion Conduction in W-doped Na $_3$SbS$_4$ from Machine Learning Force Fields | Johan Klarbring et.al. | 2403.20138 | null |
2024-03-29 | FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models | Barbara Toniella Corradini et.al. | 2403.20105 | null |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | Efficacy of the Sterile Insect Technique in the presence of inaccessible areas: A study using two-patch models | Pierre-Alexandre Bliman et.al. | 2403.20069 | null |
2024-03-29 | Optimal s-boxes against alternative operations | Marco Calderini et.al. | 2403.20059 | null |
2024-03-28 | GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling | Bowen Zhang et.al. | 2403.19655 | null |
2024-03-28 | Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond | Katherine Xu et.al. | 2403.19653 | link |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652 | null |
2024-03-28 | GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models | Yusuf Dalva et.al. | 2403.19645 | null |
2024-03-28 | In the driver's mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles | Samir H. A. Mohammad et.al. | 2403.19637 | null |
2024-03-28 | Generalisation of the Spectral Difference scheme for the diffused-interface five equation model | Niccolò Tonicello et.al. | 2403.19623 | null |
2024-03-28 | More on Black Holes Perceiving the Dark Dimension | Luis A. Anchordoqui et.al. | 2403.19604 | null |
2024-03-28 | Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model | Zhicai Wang et.al. | 2403.19600 | link |
2024-03-28 | Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Aimon Rahman et.al. | 2403.19593 | null |
2024-03-28 | Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics | Norman Di Palo et.al. | 2403.19578 | null |
2024-03-27 | ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Daniel Winter et.al. | 2403.18818 | null |
2024-03-27 | Garment3DGen: 3D Garment Stylization and Texture Generation | Nikolaos Sarafianos et.al. | 2403.18816 | null |
2024-03-28 | ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Suraj Patni et.al. | 2403.18807 | link |
2024-03-27 | Dimension-independent functional inequalities by tensorization and projection arguments | Fabrice Baudoin et.al. | 2403.18799 | null |
2024-03-27 | Object Pose Estimation via the Aggregation of Diffusion Features | Tianfu Wang et.al. | 2403.18791 | link |
2024-03-27 | ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object | Chenshuang Zhang et.al. | 2403.18775 | link |
2024-03-27 | Convergence rates under a range invariance condition with application to electrical impedance tomography | Barbara Kaltenbacher et.al. | 2403.18704 | null |
2024-03-27 | A Diffusion-Based Generative Equalizer for Music Restoration | Eloi Moliner et.al. | 2403.18636 | link |
2024-03-28 | FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing | Trong-Tung Nguyen et.al. | 2403.18605 | null |
2024-03-27 | HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions | Hao Xu et.al. | 2403.18575 | link |
2024-03-26 | ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis | Muhammad Hamza Mughal et.al. | 2403.17936 | null |
2024-03-26 | SLEDGE: Synthesizing Simulation Environments for Driving Agents with Generative Models | Kashyap Chitta et.al. | 2403.17933 | link |
2024-03-26 | The instability mechanism of compact multiplanet systems | Caleb Lammers et.al. | 2403.17928 | null |
2024-03-26 | AID: Attention Interpolation of Text-to-Image Diffusion | Qiyuan He et.al. | 2403.17924 | link |
2024-03-26 | Emergent Anomalous Hydrodynamics at Infinite Temperature in a Long-Range XXZ Model | Ang Yang et.al. | 2403.17912 | null |
2024-03-26 | The Solution to an Impulse Control Problem Motivated by Optimal Harvesting | Zhesheng Liu et.al. | 2403.17875 | null |
2024-03-26 | Boosting Diffusion Models with Moving Average Sampling in Frequency Domain | Yurui Qian et.al. | 2403.17870 | null |
2024-03-26 | Universal entropy transport far from equilibrium across the BCS-BEC crossover | Jeffrey Mohan et.al. | 2403.17838 | null |
2024-03-26 | The memory of Rayleigh-Taylor turbulence | S. Thévenin et.al. | 2403.17832 | null |
2024-03-26 | DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions | Sammy Christen et.al. | 2403.17827 | null |
2024-03-25 | Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning | Sicong Pan et.al. | 2403.16803 | link |
2024-03-25 | Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise | Dilum Fernando et.al. | 2403.16790 | null |
2024-03-25 | Diff-Def: Diffusion-Generated Deformation Fields for Conditional Atlases | Sophie Starck et.al. | 2403.16776 | null |
2024-03-25 | Stochastic Inertial Dynamics Via Time Scaling and Averaging | Rodrigo Maulen-Soto et.al. | 2403.16775 | null |
2024-03-25 | Multilevel Modeling as a Methodology for the Simulation of Human Mobility | Luca Serena et.al. | 2403.16745 | null |
2024-03-25 | A Robotic Skill Learning System Built Upon Diffusion Policies and Foundation Models | Nils Ingelhag et.al. | 2403.16730 | null |
2024-03-25 | Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss | Artem Khrapov et.al. | 2403.16728 | link |
2024-03-25 | The effect of inter-track coupling on H $_2$O$_2$ productions | Ramin Abolfath et.al. | 2403.16722 | null |
2024-03-25 | Phase Transformation in Lithium Niobate-Lithium Tantalate Solid Solutions (LiNb $_{1-x}$Ta$_x$O$_3$ ) | Fatima El Azzouzi et.al. | 2403.16717 | null |
2024-03-25 | The Directionality of Gravitational and Thermal Diffusive Transport in Geologic Fluid Storage | Anna Herring et.al. | 2403.16659 | null |
2024-03-22 | DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data | Hanrong Ye et.al. | 2403.15389 | null |
2024-03-22 | LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Kevin Xie et.al. | 2403.15385 | null |
2024-03-22 | Energy-dependent Boosted Dark Matter from Diffuse Supernova Neutrino Background | Anirban Das et.al. | 2403.15367 | null |
2024-03-22 | Ultrasound Imaging based on the Variance of a Diffusion Restoration Model | Yuxin Zhang et.al. | 2403.15316 | link |
2024-03-22 | Controlled Training Data Generation with Diffusion Models | Teresa Yeo et.al. | 2403.15309 | null |
2024-03-22 | Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies | Nicolò Botteghi et.al. | 2403.15267 | link |
2024-03-22 | Spectral Motion Alignment for Video Motion Transfer using Diffusion Models | Geon Yeong Park et.al. | 2403.15249 | null |
2024-03-22 | Shadow Generation for Composite Image Using Diffusion model | Qingyang Liu et.al. | 2403.15234 | link |
2024-03-22 | Broad Instantaneous Bandwidth Microwave Spectrum Analyzer with a Microfabricated Atomic Vapor Cell | Yongqi Shi et.al. | 2403.15155 | null |
2024-03-22 | Oxygenation of CO and NO on Amorphous Solid Water | Meenu Upadhyay et.al. | 2403.15141 | null |
2024-03-21 | Simplified Diffusion Schrödinger Bridge | Zhicong Tang et.al. | 2403.14623 | link |
2024-03-21 | GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Yinghao Xu et.al. | 2403.14621 | link |
2024-03-21 | Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Xiang Fan et.al. | 2403.14617 | null |
2024-03-21 | DreamReward: Text-to-3D Generation with Human Preference | Junliang Ye et.al. | 2403.14613 | null |
2024-03-21 | ReNoise: Real Image Inversion Through Iterative Noising | Daniel Garibi et.al. | 2403.14602 | null |
2024-03-21 | Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors | Nikolaos Tsagkas et.al. | 2403.14526 | null |
2024-03-21 | Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting | Alicia Durrer et.al. | 2403.14499 | link |
2024-03-21 | Periodicity from X-ray sources within the inner Galactic disk | Samaresh Mondal et.al. | 2403.14480 | null |
2024-03-21 | Analysing Diffusion Segmentation for Medical Images | Mathias Öttl et.al. | 2403.14440 | null |
2024-03-21 | Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation | Mathias Öttl et.al. | 2403.14429 | null |
2024-03-20 | On Pretraining Data Diversity for Self-Supervised Learning | Hasan Abed Al Kader Hammoud et.al. | 2403.13808 | link |
2024-03-20 | Editing Massive Concepts in Text-to-Image Diffusion Models | Tianwei Xiong et.al. | 2403.13807 | link |
2024-03-20 | ZigMa: Zigzag Mamba Diffusion Model | Vincent Tao Hu et.al. | 2403.13802 | link |
2024-03-20 | TimeRewind: Rewinding Time with Image-and-Events Video Diffusion | Jingxi Chen et.al. | 2403.13800 | null |
2024-03-20 | DepthFM: Fast Monocular Depth Estimation with Flow Matching | Ming Gui et.al. | 2403.13788 | link |
2024-03-20 | Anomalous diffusion in polydisperse granular gases: Monte Carlo simulations | Anna S. Bodrova et.al. | 2403.13772 | null |
2024-03-20 | Disentangling the anisotropic radio sky: Fisher forecasts for 21cm arrays | Zheng Zhang et.al. | 2403.13768 | null |
2024-03-20 | Statistical estimation of full-sky radio maps from 21cm array visibility data using Gaussian Constrained Realisations | Katrine A. Glasscock et.al. | 2403.13766 | null |
2024-03-20 | Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation | Fu-Yun Wang et.al. | 2403.13745 | link |
2024-03-20 | Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes | Yifan Chen et.al. | 2403.13724 | null |
2024-03-19 | FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis | Linjiang Huang et.al. | 2403.12963 | link |
2024-03-19 | FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Shuai Yang et.al. | 2403.12962 | link |
2024-03-19 | TexTile: A Differentiable Metric for Texture Tileability | Carlos Rodriguez-Pardo et.al. | 2403.12961 | link |
2024-03-19 | GVGEN: Text-to-3D Generation with Volumetric Representation | Xianglong He et.al. | 2403.12957 | null |
2024-03-19 | Zero-Reference Low-Light Enhancement via Physical Quadruple Priors | Wenjing Wang et.al. | 2403.12933 | null |
2024-03-19 | You Only Sample Once: Taming One-Step Text-To-Image Synthesis by Self-Cooperative Diffusion GANs | Yihong Luo et.al. | 2403.12931 | link |
2024-03-19 | Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model | Jiajie Yang et.al. | 2403.12915 | link |
2024-03-19 | H |
I. Busa et.al. | 2403.12872 | null |
2024-03-19 | D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation | Jun Yamada et.al. | 2403.12861 | null |
2024-03-19 | Generative Enhancement for 3D Medical Images | Lingting Zhu et.al. | 2403.12852 | link |
2024-03-18 | Scaling limit of heavy tailed nearly unstable INAR( |
Yingli Wang et.al. | 2403.11773 | null |
2024-03-18 | Irradiation induced mineral changes of NWA10580 meteorite determined by infrared analysis | I. Gyollai et.al. | 2403.11725 | null |
2024-03-18 | Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models | Emilian Postolache et.al. | 2403.11706 | link |
2024-03-19 | Urban Scene Diffusion through Semantic Occupancy Map | Junge Zhang et.al. | 2403.11697 | null |
2024-03-18 | Narrow absorption lines from intervening material in supernovae I. Measurements and temporal evolution | Santiago González-Gaitán et.al. | 2403.11677 | null |
2024-03-18 | Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection | Julia Wolleb et.al. | 2403.11667 | link |
2024-03-18 | Diffusion-Based Environment-Aware Trajectory Prediction | Theodor Westny et.al. | 2403.11643 | null |
2024-03-18 | Arc2Face: A Foundation Model of Human Faces | Foivos Paraperas Papantoniou et.al. | 2403.11641 | link |
2024-03-18 | Quasinormal Modes of Near-Extremal Electric and Magnetic Black Branes | Swapnil Nitin Shah et.al. | 2403.11640 | null |
2024-03-18 | LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models | Yang Yang et.al. | 2403.11627 | link |
2024-03-15 | Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives | Ronghui Li et.al. | 2403.10518 | link |
2024-03-15 | Active transport of a passive colloid in a bath of run-and-tumble particles | Tanumoy Dhar et.al. | 2403.10508 | null |
2024-03-15 | MusicHiFi: Fast High-Fidelity Stereo Vocoding | Ge Zhu et.al. | 2403.10493 | null |
2024-03-15 | New functional inequalities with applications to the arctan-fast diffusion equation | Rafael Granero-Belinchón et.al. | 2403.10458 | null |
2024-03-15 | Variance sum rule: proofs and solvable models | Ivan Di Terlizzi et.al. | 2403.10442 | null |
2024-03-15 | SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal Conditioned Diffusion Policy | Alison Bartsch et.al. | 2403.10401 | null |
2024-03-15 | Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding | Pengkun Liu et.al. | 2403.10395 | link |
2024-03-15 | Denoising Task Difficulty-based Curriculum for Training Diffusion Models | Jin-Young Kim et.al. | 2403.10348 | null |
2024-03-15 | Optimal Control of Stationary Doubly Diffusive Flows on Two and Three Dimensional Bounded Lipschitz Domains: Numerical Analysis | Jai Tushar et.al. | 2403.10282 | null |
2024-03-15 | Towards Generalizable Deepfake Video Detection with Thumbnail Layout and Graph Reasoning | Yuting Xu et.al. | 2403.10261 | link |
2024-03-14 | SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior | Huan-ang Gao et.al. | 2403.09638 | null |
2024-03-14 | 3D-VLA: A 3D Vision-Language-Action Generative World Model | Haoyu Zhen et.al. | 2403.09631 | null |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630 | link |
2024-03-14 | Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation | Fangfu Liu et.al. | 2403.09625 | null |
2024-03-14 | Score-Guided Diffusion for 3D Human Recovery | Anastasis Stathopoulos et.al. | 2403.09623 | link |
2024-03-14 | Explore In-Context Segmentation via Latent Diffusion Models | Chaoyang Wang et.al. | 2403.09616 | null |
2024-03-14 | Generative reconstruction of 3D volume elements for Ti-6Al-4V basketweave microstructure by optimization of CNN-based microstructural descriptors | Vincent Blümer et.al. | 2403.09609 | null |
2024-03-14 | The effect of spatially-varying collision frequency on the development of the Rayleigh-Taylor instability | John Rodman et.al. | 2403.09591 | null |
2024-03-14 | MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models | Zunnan Xu et.al. | 2403.09471 | null |
2024-03-14 | Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing | Wonjun Kang et.al. | 2403.09468 | link |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764 | null |
2024-03-13 | Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08758 | null |
2024-03-13 | Efficient Combinatorial Optimization via Heat Diffusion | Hengyuan Ma et.al. | 2403.08757 | link |
2024-03-13 | Sticky-threshold diffusions, local time approximation and parameter estimation | Alexis Anagnostakis et.al. | 2403.08754 | null |
2024-03-13 | Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08749 | null |
2024-03-14 | GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu et.al. | 2403.08733 | link |
2024-03-13 | Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data | Asad Aali et.al. | 2403.08728 | link |
2024-03-13 | Historical Astronomical Diagrams Decomposition in Geometric Primitives | Syrine Kalleli et.al. | 2403.08721 | null |
2024-03-13 | Limits on the OH Molecule in the Smith High Velocity Cloud | Anthony H. Minter et.al. | 2403.08704 | null |
2024-03-13 | Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment | Paraskevas Pegios et.al. | 2403.08700 | null |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | link |
2024-03-12 | Quantifying and Mitigating Privacy Risks for Tabular Generative Models | Chaoyi Zhu et.al. | 2403.07842 | null |
2024-03-12 | MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model | Guibo Luo et.al. | 2403.07838 | null |
2024-03-12 | Fragmentation of Dense Rotation-Dominated Structures Fed by Collapsing Gravomagneto-Sheetlets and Origin of Misaligned 100 au-Scale Binaries and Multiple Systems | Yisheng Tu et.al. | 2403.07777 | null |
2024-03-13 | SemCity: Semantic Scene Generation with Triplane Diffusion | Jumin Lee et.al. | 2403.07773 | link |
2024-03-12 | A first principles study of the Stark shift effect on the zero-phonon line of the NV center in diamond | Louis Alaerts et.al. | 2403.07771 | null |
2024-03-12 | Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model | Yuxuan Zhang et.al. | 2403.07764 | link |
2024-03-13 | Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion | Dongyang Li et.al. | 2403.07721 | link |
2024-03-12 | SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces | Yuta Oshima et.al. | 2403.07711 | link |
2024-03-12 | Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal | Yijun Yang et.al. | 2403.07684 | link |
2024-03-11 | BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju et.al. | 2403.06976 | link |
2024-03-11 | Bayesian Diffusion Models for 3D Shape Reconstruction | Haiyang Xu et.al. | 2403.06973 | null |
2024-03-11 | POD-ROM methods: from a finite set of snapshots to continuous-in-time approximations | Bosco Garcia-Archilla et.al. | 2403.06967 | null |
2024-03-11 | SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data | Jialu Li et.al. | 2403.06952 | null |
2024-03-12 | DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations | Tianhao Qi et.al. | 2403.06951 | link |
2024-03-11 | Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction | Qing Xiao et.al. | 2403.06940 | null |
2024-03-11 | Anderson-Higgs amplitude mode in Josephson junctions | Pierre Vallet et.al. | 2403.06878 | null |
2024-03-11 | Estimation of parameters and local times in a discretely observed threshold diffusion model | Sara Mazzonetto et.al. | 2403.06858 | null |
2024-03-11 | Orbital relaxation length from first-principles scattering calculations | Max Rang et.al. | 2403.06827 | null |
2024-03-11 | A quasilinear Keller-Segel model with saturated discontinuous advection | Maria Gualdani et.al. | 2403.06820 | null |
2024-03-08 | VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models | Yabo Zhang et.al. | 2403.05438 | link |
2024-03-08 | Radiation transport methods in star formation simulations | Richard Wünsch et.al. | 2403.05410 | null |
2024-03-08 | Simulating conditioned diffusions on manifolds | Marc Corstanje et.al. | 2403.05409 | link |
2024-03-08 | An implicit algorithm for simulating the dynamics of small dust grains with smoothed particle hydrodynamics | Daniel Elsender et.al. | 2403.05345 | null |
2024-03-08 | DiffSF: Diffusion Models for Scene Flow Estimation | Yushan Zhang et.al. | 2403.05327 | link |
2024-03-08 | Disorder-induced instability of a Weyl nodal loop semimetal towards a diffusive topological metal with protected multifractal surface states | João S. Silva et.al. | 2403.05298 | null |
2024-03-08 | Neutrino fluxes from different classes of galactic sources | Silvia Gagliardini et.al. | 2403.05288 | null |
2024-03-08 | Patricia's Bad Distributions | Louigi Addario-Berry et.al. | 2403.05269 | null |
2024-03-08 | Non-additivity in many-body interactions between membrane-deforming spheres increases disorder | Ali Azadbakht et.al. | 2403.05253 | null |
2024-03-08 | Noise Level Adaptive Diffusion Model for Robust Reconstruction of Accelerated MRI | Shoujin Huang et.al. | 2403.05245 | link |
2024-03-07 | Effects of mechanical stress, chemical potential, and coverage on hydrogen solubility during hydrogen enhanced decohesion of ferritic steel grain boundaries: A first-principles study | Abril Azocar Guzman et.al. | 2403.04741 | null |
2024-03-07 | Quantum-enhanced joint estimation of phase and phase diffusion | Jayanth Jayakumar et.al. | 2403.04722 | null |
2024-03-07 | ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes | Hashmat Shadab Malik et.al. | 2403.04701 | link |
2024-03-07 | Delving into the Trajectory Long-tail Distribution for Muti-object Tracking | Sijia Chen et.al. | 2403.04700 | link |
2024-03-07 | PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen et.al. | 2403.04692 | link |
2024-03-07 | Pix2Gif: Motion-Guided Diffusion for GIF Generation | Hitesh Kandala et.al. | 2403.04634 | link |
2024-03-07 | A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images | Cristiana Tiago et.al. | 2403.04612 | null |
2024-03-07 | Dynamic critical behavior of the chiral phase transition from the real-time functional renormalization group | Johannes V. Roth et.al. | 2403.04573 | null |
2024-03-07 | Rescaled Mode-Coupling Scheme for the Quantitative Description of Experimentally Observed Colloid Dynamics | Joel Diaz Maier et.al. | 2403.04556 | null |
2024-03-07 | Poisson equation with measure data, reconstruction formula and Doob classes of processes | Andrzej Rozkosz et.al. | 2403.04543 | null |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954 | link |
2024-03-06 | GUIDE: Guidance-based Incremental Learning with Diffusion Models | Bartosz Cywiński et.al. | 2403.03938 | link |
2024-03-06 | Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation | Xiao Ma et.al. | 2403.03890 | null |
2024-03-06 | Towards a Schauder theory for fractional viscous Hamilton--Jacobi equations | Espen R. Jakobsen et.al. | 2403.03884 | null |
2024-03-06 | Latent Dataset Distillation with Diffusion Models | Brian B. Moser et.al. | 2403.03881 | null |
2024-03-06 | Convergence rate of the Smoluchowski-Kramers approximation for diffusions with jumps | Chungang Shi et.al. | 2403.03877 | null |
2024-03-06 | Accelerating Convergence of Score-Based Diffusion Models, Provably | Gen Li et.al. | 2403.03852 | null |
2024-03-06 | Two 100 TeV neutrinos coincident with the Seyfert galaxy NGC 7469 | Giacomo Sommani et.al. | 2403.03752 | null |
2024-03-06 | Diffusion on language model embeddings for protein sequence generation | Viacheslav Meshchaninov et.al. | 2403.03726 | null |
2024-03-06 | Spectral Algorithms on Manifolds through Diffusion | Weichun Xia et.al. | 2403.03669 | null |
2024-03-05 | Moment estimates, exponential integrability, concentration inequalities and exit times estimates on evolving manifolds | Robert Baumgarth et.al. | 2403.03209 | null |
2024-03-05 | Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Patrick Esser et.al. | 2403.03206 | null |
2024-03-05 | MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Hossein Aboutalebi et.al. | 2403.03194 | link |
2024-03-05 | Behavior Generation with Latent Actions | Seungjae Lee et.al. | 2403.03181 | link |
2024-03-05 | The Amplitude Equation for the Space-Fractional Swift-Hohenberg Equation | Christian Kuehn et.al. | 2403.03158 | null |
2024-03-05 | On dynamics of gasless combustion in slowly varying periodic media: periodic fronts, their stability and propagation-extinction-diffusion-reignition pattern | Amanda Matson et.al. | 2403.03144 | null |
2024-03-05 | Enhanced beam-beam modeling to include longitudinal variation during weak-strong simulation | Derong Xu et.al. | 2403.03137 | null |
2024-03-05 | NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models | Zeqian Ju et.al. | 2403.03100 | null |
2024-03-05 | Proof-of-concept for a nonadditive stochastic model of supercooled liquids | Antonio Cesar do Prado Rosa Junior et.al. | 2403.03041 | null |
2024-03-05 | Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn's Rings | Naoya Torii et.al. | 2403.03012 | null |
2024-03-02 | Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models | Neta Shaul et.al. | 2403.01329 | null |
2024-03-02 | Longtime behavior of semilinear multi-term fractional in time diffusion | Nataliya Vasylyeva et.al. | 2403.01302 | null |
2024-03-02 | Anomalous mass dependency in Hydra endoderm cell cluster diffusion | Aline Lütz et.al. | 2403.01294 | null |
2024-03-02 | On the Arnold diffusion mechanism in Medium Earth Orbit | Elisa Maria Alessi et.al. | 2403.01283 | null |
2024-03-02 | Rigidity results for group von Neumann algebras with diffuse center | Ionuţ Chifan et.al. | 2403.01280 | null |
2024-03-02 | Analyzing the transport coefficients and observables of a rotating QGP medium in kinetic theory framework with a novel approach to the collision integral | Shubhalaxmi Rath et.al. | 2403.01240 | null |
2024-03-02 | DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction | Junwen Xiong et.al. | 2403.01226 | null |
2024-03-02 | TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion | Salaheldin Mohamed et.al. | 2403.01212 | null |
2024-03-02 | Atacama Large Aperture Submillimeter Telescope (AtLAST) science: Gas and dust in nearby galaxies | Daizhong Liu et.al. | 2403.01202 | null |
2024-03-02 | Modelling ion acceleration and transport in corotating interaction regions: the mass-to-charge ratio dependence of the particle spectrum | Zheyi Ding et.al. | 2403.01201 | null |
2024-02-29 | DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models | Muyang Li et.al. | 2402.19481 | link |
2024-02-29 | Towards Generalizable Tumor Synthesis | Qi Chen et.al. | 2402.19470 | link |
2024-02-29 | Anomalous contribution to galactic rotation curves due to stochastic spacetime | Jonathan Oppenheim et.al. | 2402.19459 | null |
2024-02-29 | Listening to the Noise: Blind Denoising with Gibbs Diffusion | David Heurtel-Depeiges et.al. | 2402.19455 | link |
2024-02-29 | Structure Preserving Diffusion Models | Haoye Lu et.al. | 2402.19369 | null |
2024-02-29 | A new analytical model of the cosmic-ray energy flux for Galactic diffuse radio emission | Andrea Bracco et.al. | 2402.19367 | null |
2024-02-29 | A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation | Hanxi Li et.al. | 2402.19330 | link |
2024-02-29 | DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly | Gianluca Scarpellini et.al. | 2402.19302 | link |
2024-02-29 | Modeling the Progenitor Stars of Observed IIP Supernovae | Kai-An You et.al. | 2402.19260 | link |
2024-02-29 | Generative models struggle with kirigami metamaterials | Gerrit Felsch et.al. | 2402.19196 | null |
2024-02-28 | Logarithmic Sobolev Inequalities for Bounded Domains and Applications to Drift-Diffusion Equations | Elie Abdo et.al. | 2402.18572 | null |
2024-02-28 | Diffusion Language Models Are Versatile Protein Learners | Xinyou Wang et.al. | 2402.18567 | link |
2024-02-28 | Photon statistics of resonantly driven spectrally diffusive quantum emitters | Aymeric Delteil et.al. | 2402.18542 | null |
2024-02-28 | Optimality conditions for sparse optimal control of viscous Cahn-Hilliard systems with logarithmic potential | Pierluigi Colli et.al. | 2402.18506 | null |
2024-02-28 | Dynamical Regimes of Diffusion Models | Giulio Biroli et.al. | 2402.18491 | null |
2024-02-28 | Introducing cuDisc: a 2D code for protoplanetary disc structure and evolution calculations | Alfie Robinson et.al. | 2402.18471 | link |
2024-02-28 | Effect of a perpendicular magnetic field on bilayer graphene under dual gating | Mouhamadou Hassane Saley et.al. | 2402.18399 | null |
2024-02-28 | Deep Confident Steps to New Pockets: Strategies for Docking Generalization | Gabriele Corso et.al. | 2402.18396 | link |
2024-02-28 | Topological charge and spin Hall effects due to skyrmions in canted antiferromagnets | A. N. Zarezad et.al. | 2402.18369 | null |
2024-02-28 | Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model | Sangjoon Park et.al. | 2402.18362 | null |
2024-02-27 | Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning | Xiaoyu Zhang et.al. | 2402.17768 | null |
2024-02-27 | Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners | Yazhou Xing et.al. | 2402.17723 | null |
2024-02-27 | Structure-Guided Adversarial Training of Diffusion Models | Ling Yang et.al. | 2402.17563 | null |
2024-02-27 | Fast Lithium Ion Diffusion in Brownmillerite $\mathrm{Li}{x}\mathrm{{Sr}{2}{Co}{2}{O}{5}}$ | Xin Chen et.al. | 2402.17557 | null |
2024-02-27 | Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label | Xinliang Zhang et.al. | 2402.17555 | link |
2024-02-27 | Forming 1D Periodic J-aggregates by Mechanical Bending of BNNTs: Evidence of Activated Molecular Diffusion | J. -B. Marceau et.al. | 2402.17537 | null |
2024-02-27 | Diffusion Model-Based Image Editing: A Survey | Yi Huang et.al. | 2402.17525 | link |
2024-02-27 | Label-Noise Robust Diffusion Models | Byeonghu Na et.al. | 2402.17517 | link |
2024-02-27 | The Unwanted Dissemination of Science: The Usage of Academic Articles as Ammunition in Contested Discursive Arenas on Twitter | Richard Zhang et.al. | 2402.17495 | null |
2024-02-27 | EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Linrui Tian et.al. | 2402.17485 | null |
2024-02-26 | Stochastic Conditional Diffusion Models for Semantic Image Synthesis | Juyeon Ko et.al. | 2402.16506 | link |
2024-02-26 | Outline-Guided Object Inpainting with Diffusion Models | Markus Pobitzer et.al. | 2402.16421 | null |
2024-02-26 | Renormalisation Group Methods for Effective Epidemiological Models | Stefan Hohenegger et.al. | 2402.16409 | null |
2024-02-26 | Entropy production for diffusion processes across a semipermeable interface | Paul C Bressloff et.al. | 2402.16403 | null |
2024-02-26 | Quantitative Propagation of Chaos for Mean Field Interacting Particle System | Xing Huang et.al. | 2402.16400 | null |
2024-02-26 | Placing Objects in Context via Inpainting for Out-of-distribution Segmentation | Pau de Jorge et.al. | 2402.16392 | link |
2024-02-26 | Generative AI in Vision: A Survey on Models, Metrics and Applications | Gaurav Raut et.al. | 2402.16369 | null |
2024-02-26 | Feedback Efficient Online Fine-Tuning of Diffusion Models | Masatoshi Uehara et.al. | 2402.16359 | null |
2024-02-26 | Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion | Xuantong Liu et.al. | 2402.16305 | null |
2024-02-26 | Graph Diffusion Policy Optimization | Yijing Liu et.al. | 2402.16302 | link |
2024-02-23 | Seamless Human Motion Composition with Blended Positional Encodings | German Barquero et.al. | 2402.15509 | link |
2024-02-23 | Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition | Chun-Hsiao Yeh et.al. | 2402.15504 | link |
2024-02-23 | Length and Velocity Scales in Protoplanetary Disk Turbulence | Debanjan Sengupta et.al. | 2402.15475 | null |
2024-02-23 | Solute transport due to periodic loading in a soft porous material | Matilde Fiori et.al. | 2402.15451 | null |
2024-02-23 | ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation | Yi Zhang et.al. | 2402.15429 | link |
2024-02-23 | Dendrites with corners | Enugala Sumanth Nani et.al. | 2402.15394 | null |
2024-02-23 | Understanding Oversmoothing in Diffusion-Based GNNs From the Perspective of Operator Semigroup Theory | Weichen Zhao et.al. | 2402.15326 | null |
2024-02-23 | Ubiquitous short-range order in multi-principal element alloys | Ying Han et.al. | 2402.15305 | null |
2024-02-23 | Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models | Shunyu Liu et.al. | 2402.15289 | link |
2024-02-23 | Generative Modelling with Tensor Train approximations of Hamilton--Jacobi--Bellman equations | David Sommer et.al. | 2402.15285 | null |
2024-02-22 | Cameras as Rays: Pose Estimation via Ray Diffusion | Jason Y. Zhang et.al. | 2402.14817 | null |
2024-02-22 | GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion | Xueyi Liu et.al. | 2402.14810 | link |
2024-02-22 | Consolidating Attention Features for Multi-view Image Editing | Or Patashnik et.al. | 2402.14792 | null |
2024-02-22 | Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Yixuan Ren et.al. | 2402.14780 | null |
2024-02-22 | Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening | Zhenrong Shen et.al. | 2402.14707 | null |
2024-02-22 | PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model | Yukiya Hono et.al. | 2402.14692 | null |
2024-02-22 | Error Estimates for First- and Second-Order Lagrange-Galerkin Moving Mesh Schemes for the One-Dimensional Convection-Diffusion Equation | Kharisma Surya Putri et.al. | 2402.14691 | null |
2024-02-22 | Structure and thermodynamics of defects in Na-feldspar from a neural network potential | Alexander Gorfer et.al. | 2402.14640 | null |
2024-02-22 | Debiasing Text-to-Image Diffusion Models | Ruifei He et.al. | 2402.14577 | null |
2024-02-22 | DynGMA: a robust approach for learning stochastic differential equations from data | Aiqing Zhu et.al. | 2402.14475 | link |
2024-02-21 | D-Flow: Differentiating through Flows for Controlled Generation | Heli Ben-Hamu et.al. | 2402.14017 | null |
2024-02-21 | SDXL-Lightning: Progressive Adversarial Diffusion Distillation | Shanchuan Lin et.al. | 2402.13929 | null |
2024-02-21 | Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate | Yuchen Liang et.al. | 2402.13901 | null |
2024-02-21 | Conformal and nonminimal couplings in fractional cosmology | Kevin Marroquín et.al. | 2402.13850 | null |
2024-02-21 | The influence of thermal pressure gradients and ionization (im)balance on the ambipolar diffusion and charge-neutral drifts | M. M. Gómez-Míguez et.al. | 2402.13813 | null |
2024-02-21 | NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion | Haoyu Li et.al. | 2402.13809 | null |
2024-02-21 | The Geography of Information Diffusion in Online Discourse on Europe and Migration | Elisa Leonardelli et.al. | 2402.13800 | null |
2024-02-21 | Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Jiayu Chen et.al. | 2402.13777 | link |
2024-02-21 | Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion | Lianghu Guo et.al. | 2402.13776 | null |
2024-02-21 | Music Style Transfer with Time-Varying Inversion of Diffusion Models | Sifei Li et.al. | 2402.13763 | null |
2024-02-20 | Nonequilibrium fluctuations of chemical reaction networks at criticality: The Schlögl model as paradigmatic case | Benedikt Remlein et.al. | 2402.13168 | null |
2024-02-20 | Neural Network Diffusion | Kai Wang et.al. | 2402.13144 | link |
2024-02-20 | Ultrafast lattice disordering can be accelerated by electronic collisional forces | Gilberto A. de la Pena Munoz et.al. | 2402.13133 | null |
2024-02-20 | How accurate are simulations and experiments for the lattice energies of molecular crystals? | Flaviano Della Pia et.al. | 2402.13059 | null |
2024-02-20 | Excited state-specific CASSCF theory for the torsion of ethylene | Sandra Saade et.al. | 2402.13046 | null |
2024-02-20 | Text-Guided Molecule Generation with Diffusion Language Model | Haisong Gong et.al. | 2402.13040 | link |
2024-02-20 | The Anomalous Long-Ranged Influence of an Inclusion in Momentum-Conserving Active Fluids | Thibaut Arnoulx de Pirey et.al. | 2402.12996 | null |
2024-02-20 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974 | link |
2024-02-20 | CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection | Sohail Ahmed Khan et.al. | 2402.12927 | link |
2024-02-20 | RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models | Xinchen Zhang et.al. | 2402.12908 | link |
2024-02-19 | FiT: Flexible Vision Transformer for Diffusion Model | Zeyu Lu et.al. | 2402.12376 | link |
2024-02-19 | A Lower Bound for Estimating Fréchet Means | Shayan Hundrieser et.al. | 2402.12290 | null |
2024-02-19 | Analysis of Persian News Agencies on Instagram, A Words Co-occurrence Graph-based Approach | Mohammad Heydari et.al. | 2402.12272 | null |
2024-02-19 | Synthetic location trajectory generation using categorical diffusion models | Simon Dirmeier et.al. | 2402.12242 | link |
2024-02-19 | Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations | Jonas Beck et.al. | 2402.12231 | link |
2024-02-19 | Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training | Leo Hyun Park et.al. | 2402.12187 | null |
2024-02-19 | Anomalous Diffusion, Prethermalization, and Particle Binding in an Interacting Flat Band System | Mirko Daumann et.al. | 2402.12180 | null |
2024-02-19 | Human Video Translation via Query Warping | Haiming Zhu et.al. | 2402.12099 | null |
2024-02-19 | Malliavin Calculus for rough stochastic differential equations | Fabio Bugini et.al. | 2402.12056 | null |
2024-02-19 | Constraining the stellar populations of ultra-diffuse galaxies in the MATLAS survey using spectral energy distribution fitting | Maria Luisa Buzzo et.al. | 2402.12033 | null |
2024-02-16 | Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning | Chia-Ling Tsai et.al. | 2402.10894 | null |
2024-02-16 | 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Tsung-Wei Ke et.al. | 2402.10885 | null |
2024-02-16 | Electronic Conductivity Measurements in Solid Electrolytes Using an Ion Blocking Microelectrode: Noise Rejection Based on a Median Filter | Veyis Gunes et.al. | 2402.10883 | null |
2024-02-16 | Control Color: Multimodal Diffusion-based Interactive Image Colorization | Zhexin Liang et.al. | 2402.10855 | null |
2024-02-16 | Training Class-Imbalanced Diffusion Model Via Overlap Optimization | Divin Yan et.al. | 2402.10821 | link |
2024-02-16 | VATr++: Choose Your Words Wisely for Handwritten Text Generation | Bram Vanherle et.al. | 2402.10798 | null |
2024-02-16 | Nearly-optimal effective stability estimates around Diophantine tori of Hölder Hamiltonians | Santiago Barbieri et.al. | 2402.10764 | null |
2024-02-16 | Revisiting a Core-Jet Laboratory at High Redshift: Analysis of the Radio Jet in the Quasar PKS 2215+020 at z=3.572 | Sándor Frey et.al. | 2402.10722 | null |
2024-02-16 | Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation | Hongbin Na et.al. | 2402.10699 | null |
2024-02-16 | Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm | Yuanzhen Xie et.al. | 2402.10671 | link |
2024-02-15 | Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation | Huizhuo Yuan et.al. | 2402.10210 | null |
2024-02-15 | Recovering the Pre-Fine-Tuning Weights of Generative Models | Eliahu Horwitz et.al. | 2402.10208 | link |
2024-02-15 | Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Rui Yang et.al. | 2402.10207 | link |
2024-02-15 | Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model | Mariia Drozdova et.al. | 2402.10204 | link |
2024-02-15 | Tracer dynamics in polymer networks: generalized Langevin description | Sebastian Milster et.al. | 2402.10148 | null |
2024-02-15 | Energy Flux Decomposition in Magnetohydrodynamic Turbulence | D. Capocci et.al. | 2402.10125 | null |
2024-02-15 | A Blob Method for Mean Field Control With Terminal Constraints | Katy Craig et.al. | 2402.10124 | link |
2024-02-15 | Collision efficiency of droplets across diffusive, electrostatic and inertial regimes | Florian Poydenot et.al. | 2402.10117 | null |
2024-02-15 | Quantized Embedding Vectors for Controllable Diffusion Language Models | Cheng Kang et.al. | 2402.10107 | null |
2024-02-15 | Classification Diffusion Models | Shahar Yadin et.al. | 2402.10095 | null |
2024-02-14 | Magic-Me: Identity-Specific Video Customized Diffusion | Ze Ma et.al. | 2402.09368 | link |
2024-02-14 | Investigation of Ga interstitial and vacancy diffusion in |
Channyung Lee et.al. | 2402.09354 | null |
2024-02-14 | On the system size dependence of the diffusion coefficients in MD simulations: A simple correction formula for pure dense fluids | Sergey Khrapak et.al. | 2402.09348 | null |
2024-02-14 | Lattice B-field correlators for heavy quarks | Luis Altenkort et.al. | 2402.09337 | null |
2024-02-14 | Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio | Pablo Alonso-Jiménez et.al. | 2402.09318 | null |
2024-02-14 | Disentangling the origin of chemical differences using GHOST | C. Saffe et.al. | 2402.09278 | null |
2024-02-14 | A Modular Deep Learning-based Approach for Diffuse Optical Tomography Reconstruction | Alessandro Benfenati et.al. | 2402.09277 | null |
2024-02-14 | Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection | Pengfei Zhou et.al. | 2402.09242 | link |
2024-02-14 | Modeling of groundwater flow in porous medium layered over inclined impermeable bed | Petr Girg et.al. | 2402.09215 | null |
2024-02-14 | A universal scaling limit for diffusive amnesic step-reinforced random walks | Marco Bertenghi et.al. | 2402.09202 | null |
2024-02-13 | IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation | Luke Melas-Kyriazi et.al. | 2402.08682 | null |
2024-02-13 | Chain Reaction of Ideas: Can Radioactive Decay Predict Technological Innovation? | Guilherme S. Y. Giardini et.al. | 2402.08681 | null |
2024-02-13 | Target Score Matching | Valentin De Bortoli et.al. | 2402.08667 | null |
2024-02-13 | Learning Continuous 3D Words for Text-to-Image Generation | Ta-Ying Cheng et.al. | 2402.08654 | link |
2024-02-13 | Clustering of primordial black holes from quantum diffusion during inflation | Chiara Animali et.al. | 2402.08642 | null |
2024-02-13 | Latent Inversion with Timestep-aware Sampling for Training-free Non-rigid Editing | Yunji Jung et.al. | 2402.08601 | null |
2024-02-13 | Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator | Amartya Mukherjee et.al. | 2402.08563 | null |
2024-02-13 | Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases | Ziyi Zhang et.al. | 2402.08552 | link |
2024-02-13 | Branching Interval Partition Diffusions | Matthew Buckland et.al. | 2402.08548 | null |
2024-02-13 | Hyperballistic transport in dense ionized matter under external AC electric fields | Daniele Gamba et.al. | 2402.08519 | null |
2024-02-12 | Label-Efficient Model Selection for Text Generation | Shir Ashury-Tahan et.al. | 2402.07891 | null |
2024-02-12 | High-order harmonic generation in 2D Transition Metal Disulphides | Jose Manuel Iglesias et.al. | 2402.07850 | null |
2024-02-12 | Self-heating effects and switching dynamics in graphene multiterminal Josephson junctions | Máté Kedves et.al. | 2402.07831 | null |
2024-02-12 | Towards a mathematical theory for consistency training in diffusion models | Gen Li et.al. | 2402.07802 | null |
2024-02-12 | Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models | Jiacheng Ye et.al. | 2402.07754 | link |
2024-02-12 | The GALAH survey: Elemental abundances in open clusters using joint effective temperature and surface gravity photometric priors | Kevin L. Beeson et.al. | 2402.07748 | null |
2024-02-12 | Topological Edge States in Reconfigurable Multi-stable Mechanical Metamaterials | Zhen Wang et.al. | 2402.07707 | null |
2024-02-12 | Metastability and time scales for parabolic equations with drift 2: the general time scale | Claudio Landim et.al. | 2402.07695 | null |
2024-02-12 | Cosmology at the Field Level with Probabilistic Machine Learning | Adam Rouhiainen et.al. | 2402.07694 | null |
2024-02-12 | Higher-order Connection Laplacians for Directed Simplicial Complexes | Xue Gong et.al. | 2402.07631 | null |
2024-02-09 | The impact of different unravelings in a monitored system of free fermions | Giulia Piccitto et.al. | 2402.06597 | null |
2024-02-09 | Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following | Brian Yang et.al. | 2402.06559 | link |
2024-02-09 | The role of mobility in epidemics near criticality | Beatrice Nettuno et.al. | 2402.06505 | null |
2024-02-09 | Sequential Flow Matching for Generative Modeling | Jongmin Yoon et.al. | 2402.06461 | null |
2024-02-09 | ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation | Fengyi Shen et.al. | 2402.06446 | null |
2024-02-09 | Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation | Peter Hönig et.al. | 2402.06436 | null |
2024-02-09 | Enhanced bubble growth near an advancing solidification front | Jochem G. Meijer et.al. | 2402.06409 | null |
2024-02-09 | Spectral properties of the Dirichlet-to-Neumann operator for spheroids | Denis S. Grebenkov et.al. | 2402.06372 | null |
2024-02-09 | Sparse identification of nonlocal interaction kernels in nonlinear gradient flow equations via partial inversion | Jose A. Carrillo et.al. | 2402.06355 | null |
2024-02-09 | Particle Denoising Diffusion Sampler | Angus Phillips et.al. | 2402.06320 | link |
2024-02-08 | InstaGen: Enhancing Object Detection by Training on Synthetic Dataset | Chengjian Feng et.al. | 2402.05937 | null |
2024-02-08 | Time Series Diffusion in the Frequency Domain | Jonathan Crabbé et.al. | 2402.05933 | link |
2024-02-08 | Dirichlet Flow Matching with Applications to DNA Sequence Design | Hannes Stark et.al. | 2402.05841 | link |
2024-02-08 | AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning | Wamiq Reyaz Para et.al. | 2402.05803 | null |
2024-02-08 | Determining the significance and relative importance of parameters of a simulated quenching algorithm using statistical tools | Pedro A. Castillo et.al. | 2402.05791 | null |
2024-02-08 | Hydrogen abstraction from metal surfaces: When electron-hole pair excitations strongly affect hot-atom recombination | Oihana Galparsoro et.al. | 2402.05743 | null |
2024-02-08 | First operation of a multi-channel Q-Pix prototype: measuring transverse electron diffusion in a gas time projection chamber | Nora Hoch et.al. | 2402.05734 | null |
2024-02-08 | DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer | Zhiyuan Ma et.al. | 2402.05712 | link |
2024-02-08 | Discovery and characterisation of a new Galactic Planetary Nebula | W. E. Celnik et.al. | 2402.05658 | null |
2024-02-08 | Scalable Diffusion Models with State Space Backbone | Zhengcong Fei et.al. | 2402.05608 | link |
2024-02-07 | Nature of the diffuse emission sources in the H I supershell in the galaxy IC 1613 | Anastasiya D. Yarovova et.al. | 2402.05107 | null |
2024-02-07 | On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling | Marcin Sendera et.al. | 2402.05098 | link |
2024-02-07 | Convergence of spatial branching processes to |
Félix Foutel-Rodier et.al. | 2402.05096 | null |
2024-02-07 | Interacting particle approximation of cross-diffusion systems | Jose Antonio Carrillo et.al. | 2402.05094 | null |
2024-02-07 | NITO: Neural Implicit Fields for Resolution-free Topology Optimization | Amin Heyrani Nobari et.al. | 2402.05073 | link |
2024-02-07 | LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | Jiaxiang Tang et.al. | 2402.05054 | null |
2024-02-07 | Non-reversible lifts of reversible diffusion processes and relaxation times | Andreas Eberle et.al. | 2402.05041 | null |
2024-02-07 | Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design | Andrew Campbell et.al. | 2402.04997 | link |
2024-02-07 | On the Cahn-Hilliard equation with kinetic rate dependent dynamic boundary conditions and non-smooth potentials: Well-posedness and asymptotic limits | Maoyin Lv et.al. | 2402.04965 | null |
2024-02-07 | Hidden non-equilibrium pathways towards crystalline perfection | A. Mangu et.al. | 2402.04962 | null |
2024-02-06 | Geometric theory of (extended) time-reversal symmetries in stochastic processes -- Part I: finite dimension | Jérémy O'Byrne et.al. | 2402.04217 | null |
2024-02-06 | Maximal regularity and optimal control for a non-local Cahn-Hilliard tumour growth model | Matteo Fornoni et.al. | 2402.04204 | null |
2024-02-06 | SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models | Yichen Shi et.al. | 2402.04178 | link |
2024-02-06 | Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Ruoqi Zhang et.al. | 2402.04080 | link |
2024-02-06 | Generative Modeling of Graphs via Joint Diffusion of Node and Edge Attributes | Nimrod Berman et.al. | 2402.04046 | null |
2024-02-06 | PAC-Bayesian Adversarially Robust Generalization Bounds for Graph Neural Network | Tan Sun et.al. | 2402.04038 | null |
2024-02-06 | Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation | Zolnamar Dorjsembe et.al. | 2402.04031 | link |
2024-02-06 | Space Group Constrained Crystal Generation | Rui Jiao et.al. | 2402.03992 | null |
2024-02-06 | Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting | Yiming Xu et.al. | 2402.03981 | null |
2024-02-06 | Weibel- and non-resonant Whistler wave growth in an expanding plasma in a 1D simulation geometry | M E Dieckmann et.al. | 2402.03925 | null |
2024-02-05 | Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? | Qiyao Liang et.al. | 2402.03305 | null |
2024-02-05 | Zero-shot Object-Level OOD Detection with Context-Aware Inpainting | Quang-Huy Nguyen et.al. | 2402.03292 | null |
2024-02-05 | InstanceDiffusion: Instance-level Control for Image Generation | Xudong Wang et.al. | 2402.03290 | link |
2024-02-05 | Estimating position-dependent and anisotropic diffusivity tensors from molecular dynamics trajectories: Existing methods and future outlook | Tiago Domingues et.al. | 2402.03285 | null |
2024-02-05 | Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? | Anna Yoo Jeong Ha et.al. | 2402.03214 | null |
2024-02-05 | Light and Optimal Schrödinger Bridge Matching | Nikita Gushchin et.al. | 2402.03207 | link |
2024-02-05 | Guidance with Spherical Gaussian Constraint for Conditional Diffusion | Lingxiao Yang et.al. | 2402.03201 | link |
2024-02-05 | Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion | Shiyuan Yang et.al. | 2402.03162 | null |
2024-02-05 | Nonlinear feedback of the electrostatic instability on the blazar-induced pair beam and GeV cascade | Mahmoud Alawashra et.al. | 2402.03127 | null |
2024-02-05 | DARTS: Diffusion Approximated Residual Time Sampling for Low Variance Time-of-flight Rendering in Homogeneous Scattering Medium | Qianyue He et.al. | 2402.03106 | null |
2024-02-02 | Revealing crucial effects of reservoir environment and hydrocarbon fractions on fluid behaviour in kaolinite pores | Rixin Zhao et.al. | 2402.01633 | null |
2024-02-02 | NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties | Jingyuan Sun et.al. | 2402.01590 | null |
2024-02-02 | Transformation semigroups and their applications | Katarzyna Pichór et.al. | 2402.01572 | null |
2024-02-02 | Boximator: Generating Rich and Controllable Motions for Video Synthesis | Jiawei Wang et.al. | 2402.01566 | null |
2024-02-02 | Resolution dependence of most probable pathways with state-dependent diffusivity | Alice L. Thorneywork et.al. | 2402.01559 | null |
2024-02-02 | The galactic bubbles of starburst galaxies The influence of galactic large-scale magnetic fields | Z. Meliani et.al. | 2402.01541 | null |
2024-02-02 | Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations | Panos Kakoulidis et.al. | 2402.01520 | null |
2024-02-02 | Cross-view Masked Diffusion Transformers for Person Image Synthesis | Trung X. Pham et.al. | 2402.01516 | link |
2024-02-02 | Binomial-tree approximation for time-inconsistent stopping | Erhan Bayraktar et.al. | 2402.01482 | null |
2024-02-02 | SVI solutions to stochastic nonlinear diffusion equations on general measure spaces | Benjamin Gess et.al. | 2402.01479 | null |
2024-02-01 | AToM: Amortized Text-to-Mesh using 2D Diffusion | Guocheng Qian et.al. | 2402.00867 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864 | link |
2024-02-01 | An Analysis of the Variance of Diffusion-based Speech Enhancement | Bunlong Lay et.al. | 2402.00811 | null |
2024-02-01 | Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching | Shangzhe Li et.al. | 2402.00807 | null |
2024-02-01 | AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning | Fu-Yun Wang et.al. | 2402.00769 | link |
2024-02-01 | The Sonora Substellar Atmosphere Models. IV. Elf Owl: Atmospheric Mixing and Chemical Disequilibrium with Varying Metallicity and C/O Ratios | Sagnick Mukherjee et.al. | 2402.00756 | null |
2024-02-01 | Neutral carbon in diffuse interstellar medium: abundance matching with H2 for DLAs at high redshifts | Sergei Balashev et.al. | 2402.00714 | null |
2024-02-01 | Cylindrically symmetric diffusion model for relativistic heavy-ion collisions | Johannes Hoelck et.al. | 2402.00628 | null |
2024-02-01 | CapHuman: Capture Your Moments in Parallel Universes | Chao Liang et.al. | 2402.00627 | link |
2024-02-01 | Diffusion-based Light Field Synthesis | Ruisheng Gao et.al. | 2402.00575 | null |
2024-01-31 | Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators | Daniel Geng et.al. | 2401.18085 | null |
2024-01-31 | An electrodynamic wave model for the action potential | Vitaly L. Galinsky et.al. | 2401.18051 | null |
2024-01-31 | Reversible, Irreversible and Mixed Regimes for Periodically Driven Disks in Random Obstacle Arrays | D. Minogue et.al. | 2401.18042 | null |
2024-01-31 | Ljusternik-Schnirelmann eigenvalues for the fractional $m-$Laplacian without the |
Julian Fernandez Bonder et.al. | 2401.18041 | null |
2024-01-31 | Diagnosing the particle transport mechanism in the pulsar halo via X-ray observations | Qi-Zuo Wu et.al. | 2401.17982 | null |
2024-01-31 | Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances | Xuefeng Gao et.al. | 2401.17958 | null |
2024-01-31 | Investigation of Microstructure and Corrosion Resistance of Ti-Al-V Titanium Alloys Obtained by Spark Plasma Sintering | Aleksey Nokhrin et.al. | 2401.17941 | null |
2024-01-31 | Lipolysis on Lipid Droplets: Mathematical Modelling and Numerical Discretisation | Reymart Salcedo Lagunero et.al. | 2401.17935 | link |
2024-01-31 | AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error | Jonas Ricker et.al. | 2401.17879 | link |
2024-01-31 | Multiplicity results for mass constrained Allen-Cahn equations on Riemannian manifolds with boundary | Dario Corona et.al. | 2401.17847 | null |
2024-01-30 | Study of X-ray emission from the S147 nebula with SRG/eROSITA: X-ray imaging, spectral characterization and a multiwavelength picture | Miltiadis Michailidis et.al. | 2401.17312 | null |
2024-01-30 | G321.3-3.9: a new supernova remnant observed with multi-band radio data and in the SRG/eROSITA All-Sky Surveys | S. Mantovanini et.al. | 2401.17294 | null |
2024-01-30 | Discovery of the Goat Horn complex: a |
Nicola Locatelli et.al. | 2401.17291 | null |
2024-01-30 | A new understanding of the Gemini-Monoceros X-ray enhancement from discoveries with eROSITA | Jonathan R. Knies et.al. | 2401.17289 | null |
2024-01-30 | Probing the physical properties of the IGM using SRG/eROSITA spectra from blazars | E. Gatuzz et.al. | 2401.17283 | null |
2024-01-30 | You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation | Mehdi Noroozi et.al. | 2401.17258 | null |
2024-01-30 | Stochastic motions of the two-dimensional many-body delta-Bose gas | Yu-Ting Chen et.al. | 2401.17243 | null |
2024-01-30 | ContactGen: Contact-Guided Interactive 3D Human Generation for Partners | Dongjun Gu et.al. | 2401.17212 | null |
2024-01-30 | Quantum dynamics in one and two dimensions via recursion method | Filipp Uskov et.al. | 2401.17211 | null |
2024-01-30 | Transfer Learning for Text Diffusion Models | Kehang Han et.al. | 2401.17181 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-03 | TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation | Jiajie Liu et.al. | 2501.01770 | null |
2024-12-30 | LS-GAN: Human Motion Synthesis with Latent-space GANs | Avinash Amballa et.al. | 2501.01449 | null |
2024-12-31 | Spatio-Temporal Multi-Subgraph GCN for 3D Human Motion Prediction | Jiexin Wang et.al. | 2501.00317 | null |
2024-12-31 | Temporal Dynamics Decoupling with Inverse Processing for Enhancing Human Motion Prediction | Jiexin Wang et.al. | 2501.00315 | null |
2024-12-30 | A Standardized Framework for Sensor Placement in Human Motion Capture and Wearable Applications | Seyed Yahya Shirazi et.al. | 2412.21159 | link |
2024-12-29 | Safe Bayesian Optimization for the Control of High-Dimensional Embodied Systems | Yunyue Wei et.al. | 2412.20350 | null |
2024-12-26 | Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos | Changwoon Choi et.al. | 2412.19089 | null |
2024-12-25 | Simultaneously Recovering Multi-Person Meshes and Multi-View Cameras with Human Semantics | Buzhen Huang et.al. | 2412.18785 | link |
2024-12-25 | Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion | Yuheng Yang et.al. | 2412.18780 | link |
2024-12-24 | ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation | Hongjie Li et.al. | 2412.18600 | null |
2024-12-23 | A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions | Youliang Zhang et.al. | 2412.17377 | null |
2024-12-23 | Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection | Andi Xu et.al. | 2412.17210 | link |
2024-12-22 | InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions | Ronghui Li et.al. | 2412.16982 | null |
2024-12-20 | Robustness-enhanced Myoelectric Control with GAN-based Open-set Recognition | Cheng Wang et.al. | 2412.15819 | null |
2024-12-20 | SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control | Xiaohan Zhang et.al. | 2412.15664 | null |
2024-12-21 | Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos | Haitao Tian et.al. | 2412.14988 | null |
2024-12-19 | EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space | Jianrong Zhang et.al. | 2412.14706 | null |
2024-12-19 | DirectorLLM for Human-Centric Video Generation | Kunpeng Song et.al. | 2412.14484 | null |
2024-12-23 | THÖR-MAGNI Act: Actions for Human Motion Modeling in Robot-Shared Industrial Spaces | Tiago Rodrigues de Almeida et.al. | 2412.13729 | link |
2024-12-17 | Move-in-2D: 2D-Conditioned Human Motion Generation | Hsin-Ping Huang et.al. | 2412.13185 | null |
2024-12-17 | Motion-2-to-3: Leveraging 2D Motion Data to Boost 3D Motion Generation | Huaijin Pi et.al. | 2412.13111 | null |
2024-12-15 | Challenges and Opportunities Associated with Technology Driven Biomechanical Simulations | Zartasha Mustansar et.al. | 2412.12209 | null |
2024-12-16 | Multi-Scale Incremental Modeling for Enhanced Human Motion Prediction in Human-Robot Collaboration | Juncheng Zou et.al. | 2412.11632 | null |
2024-12-16 | Visual IRL for Human-Like Robotic Manipulation | Ehsan Asali et.al. | 2412.11360 | null |
2024-12-15 | Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation | Ling-An Zeng et.al. | 2412.11193 | link |
2024-12-13 | The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion | Changan Chen et.al. | 2412.10523 | null |
2024-12-12 | Motion Generation Review: Exploring Deep Learning for Lifelike Animation with Manifold | Jiayi Zhao et.al. | 2412.10458 | null |
2024-12-13 | EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling | Songpengcheng Xia et.al. | 2412.10235 | null |
2024-12-05 | Mogo: RQ Hierarchical Causal Transformer for High-Quality 3D Human Motion Generation | Dongjie Fu et.al. | 2412.07797 | null |
2024-12-10 | CoMA: Compositional Human Motion Generation with Multi-modal Agents | Shanlin Sun et.al. | 2412.07320 | null |
2024-12-09 | One-shot Human Motion Transfer via Occlusion-Robust Flow Prediction and Neural Texturing | Yuzhu Ji et.al. | 2412.06174 | null |
2024-12-09 | Homogeneous Dynamics Space for Heterogeneous Humans | Xinpeng Liu et.al. | 2412.06146 | null |
2024-12-06 | CigTime: Corrective Instruction Generation Through Inverse Motion Editing | Qihang Fang et.al. | 2412.05460 | null |
2024-12-06 | Text to Blind Motion | Hee Jae Kim et.al. | 2412.05277 | null |
2024-12-06 | Assessing Similarity Measures for the Evaluation of Human-Robot Motion Correspondence | Charles Dietzel et.al. | 2412.04820 | null |
2024-12-05 | Generating Whole-Body Avoidance Motion through Localized Proximity Sensing | Simone Borelli et.al. | 2412.04649 | null |
2024-12-05 | RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse | Zhouyingcheng Liao et.al. | 2412.04343 | null |
2024-12-03 | Diffusion Implicit Policy for Unpaired Scene-aware Motion Synthesis | Jingyu Gong et.al. | 2412.02261 | null |
2024-12-02 | Continuous-Time Human Motion Field from Events | Ziyun Wang et.al. | 2412.01747 | null |
2024-12-02 | Dual-Branch Graph Transformer Network for 3D Human Mesh Reconstruction from Video | Tao Tang et.al. | 2412.01179 | link |
2024-11-30 | Human Action CLIPS: Detecting AI-generated Human Motion | Matyas Bohacek et.al. | 2412.00526 | null |
2024-12-03 | OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation | Hui Li et.al. | 2412.00115 | null |
2024-11-28 | BiPO: Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis | Seong-Eun Hong et.al. | 2412.00112 | null |
2024-11-29 | MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks | Yiming Wu et.al. | 2411.19786 | null |
2024-12-02 | DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding | Jungbin Cho et.al. | 2411.19527 | null |
2024-11-29 | Fleximo: Towards Flexible Text-to-Human Motion Video Generation | Yuhang Zhang et.al. | 2411.19459 | null |
2024-11-27 | DiffMVR: Diffusion-based Automated Multi-Guidance Video Restoration | Zheyan Zhang et.al. | 2411.18745 | null |
2024-11-27 | AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward | Haonan Han et.al. | 2411.18654 | null |
2024-11-27 | InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation | Wenjie Zhuo et.al. | 2411.18303 | null |
2024-11-30 | MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation | Haopeng Fang et.al. | 2411.18281 | null |
2024-11-26 | FTMoMamba: Motion Generation with Frequency and Text State Space Models | Chengjian Li et.al. | 2411.17532 | null |
2024-11-27 | MotionWavelet: Human Motion Prediction via Wavelet Manifold Learning | Yuming Feng et.al. | 2411.16964 | null |
2024-11-25 | Statistical Emulation of Human Operational Motions | Yanliang Chen et.al. | 2411.16929 | null |
2024-11-27 | Human Motion Instruction Tuning | Lei Li et.al. | 2411.16805 | null |
2024-11-24 | Bundle Adjusted Gaussian Avatars Deblurring | Muyao Niu et.al. | 2411.16758 | null |
2024-11-25 | Rethinking Diffusion for Text-Driven Human Motion Generation | Zichong Meng et.al. | 2411.16575 | null |
2024-11-25 | Multi-Resolution Generative Modeling of Human Motion from Limited Data | David Eduardo Moreno-Villamarín et.al. | 2411.16498 | null |
2024-11-25 | Performance Benchmarking of Psychomotor Skills Using Wearable Devices: An Application in Sport | Mahela Pandukabhaya et.al. | 2411.16168 | null |
2024-11-23 | KinMo: Kinematic-aware Human Motion Understanding and Generation | Pengfei Zhang et.al. | 2411.15472 | null |
2024-11-22 | PRIMUS: Pretraining IMU Encoders with Multimodal Self-Supervision | Arnav M. Das et.al. | 2411.15127 | null |
2024-11-22 | Morph: A Motion-free Physics Optimization Framework for Human Motion Generation | Zhuo Li et.al. | 2411.14951 | null |
2024-11-19 | VioPose: Violin Performance 4D Pose Estimation by Hierarchical Audiovisual Inference | Seong Jong Yoo et.al. | 2411.13607 | link |
2024-11-20 | Fine-tuning Myoelectric Control through Reinforcement Learning in a Game Environment | Kilian Freitag et.al. | 2411.13327 | null |
2024-11-19 | Towards motion from video diffusion models | Paul Janson et.al. | 2411.12831 | null |
2024-11-19 | Breathless: An 8-hour Performance Contrasting Human and Robot Expressiveness | Catie Cuan et.al. | 2411.12361 | null |
2024-11-15 | Motion Diffusion-Guided 3D Global HMR from a Dynamic Camera | Jaewoo Heo et.al. | 2411.10582 | null |
2024-11-13 | DiVR: incorporating context from diverse VR scenes for human trajectory prediction | Franz Franco Gallo et.al. | 2411.08409 | null |
2024-11-10 | KMM: Key Frame Mask Mamba for Extended Motion Generation | Zeyu Zhang et.al. | 2411.06481 | link |
2024-11-10 | Learning Uniformly Distributed Embedding Clusters of Stylistic Skills for Physically Simulated Characters | Nian Liu et.al. | 2411.06459 | null |
2024-11-08 | Poze: Sports Technique Feedback under Data Constraints | Agamdeep Singh et.al. | 2411.05734 | null |
2024-11-07 | ProGraph: Temporally-alignable Probability Guided Graph Topological Modeling for 3D Human Reconstruction | Hongsheng Wang et.al. | 2411.04399 | null |
2024-11-06 | UnityGraph: Unified Learning of Spatio-temporal features for Multi-person Motion Prediction | Kehua Qu et.al. | 2411.04151 | null |
2024-11-06 | Object-Centric Dexterous Manipulation from Human Motion Data | Yuanpei Chen et.al. | 2411.04005 | null |
2024-11-04 | Multi-Transmotion: Pre-trained Model for Human Motion Prediction | Yang Gao et.al. | 2411.02673 | link |
2024-11-07 | Differentially Private Integrated Decision Gradients (IDG-DP) for Radar-based Human Activity Recognition | Idris Zakariyya et.al. | 2411.02099 | link |
2024-11-04 | MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence | Fuming You et.al. | 2411.01805 | null |
2024-11-07 | Online Relational Inference for Evolving Multi-agent Interacting Systems | Beomseok Kang et.al. | 2411.01442 | link |
2024-10-31 | Muscles in Time: Learning to Understand Human Motion by Simulating Muscle Activations | David Schneider et.al. | 2411.00128 | link |
2024-10-31 | TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation | Sunjae Yoon et.al. | 2410.24037 | null |
2024-10-29 | MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding | Yuan Wang et.al. | 2410.21747 | null |
2024-11-01 | RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior | Mingjiang Liang et.al. | 2410.20358 | null |
2024-10-24 | MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms | Ling-Hao Chen et.al. | 2410.18977 | null |
2024-10-23 | Efficient Neural Implicit Representation for 3D Human Reconstruction | Zexu Huang et.al. | 2410.17741 | link |
2024-10-23 | ImDy: Human Inverse Dynamics from Imitated Observations | Xinpeng Liu et.al. | 2410.17610 | null |
2024-10-22 | MotionGlot: A Multi-Embodied Motion Generation Model | Sudarshan Harithas et.al. | 2410.16623 | null |
2024-10-21 | ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos | Tao Tang et.al. | 2410.15582 | link |
2024-10-18 | LEAD: Latent Realignment for Human Motion Diffusion | Nefeli Andreou et.al. | 2410.14508 | null |
2024-10-17 | MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations | Liang Xu et.al. | 2410.13790 | link |
2024-10-16 | Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions | Zhenyu Jiang et.al. | 2410.12773 | null |
2024-10-16 | Fast Online Learning of CLiFF-maps in Changing Environments | Yufei Zhu et.al. | 2410.12237 | null |
2024-10-15 | Learned Neural Physics Simulation for Articulated 3D Human Pose Reconstruction | Mykhaylo Andriluka et.al. | 2410.12023 | null |
2024-10-15 | OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation | Jinhan Li et.al. | 2410.11792 | null |
2024-10-15 | MoChat: Joints-Grouped Spatio-Temporal Grounding LLM for Multi-Turn Motion Comprehension and Description | Jiawei Mo et.al. | 2410.11404 | null |
2024-10-14 | Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes | Jianqi Chen et.al. | 2410.10790 | link |
2024-10-14 | DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation | James R. Han et.al. | 2410.10646 | null |
2024-10-10 | Online DNN-driven Nonlinear MPC for Stylistic Humanoid Robot Walking with Step Adjustment | Giulio Romualdi et.al. | 2410.07849 | null |
2024-10-10 | Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos | Cuong Le et.al. | 2410.07795 | link |
2024-10-10 | Generalization Ability Analysis of Through-the-Wall Radar Human Activity Recognition | Weicheng Gao et.al. | 2410.07543 | null |
2024-10-15 | ReinDiffuse: Crafting Physically Plausible Motions with Reinforced Diffusion Model | Gaoge Han et.al. | 2410.07296 | null |
2024-10-09 | LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning | Zhe Li et.al. | 2410.07093 | null |
2024-10-09 | LocoVR: Multiuser Indoor Locomotion Dataset in Virtual Reality | Kojiro Takeyama et.al. | 2410.06437 | link |
2024-10-08 | Construction of Musculoskeletal Simulation for Shoulder Complex with Ligaments and Its Validation via Model Predictive Control | Yuta Sahara et.al. | 2410.05931 | null |
2024-10-07 | DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control | Kaifeng Zhao et.al. | 2410.05260 | null |
2024-10-07 | Anticipating Human Behavior for Safe Navigation and Efficient Collaborative Manipulation with Mobile Service Robots | Simon Bultmann et.al. | 2410.05015 | null |
2024-10-04 | MDMP: Multi-modal Diffusion for supervised Motion Predictions with uncertainty | Leo Bringer et.al. | 2410.03860 | link |
2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665 | null |
2024-10-04 | CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control | Guy Tevet et.al. | 2410.03441 | link |
2024-10-04 | Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models | Ye Wang et.al. | 2410.03311 | null |
2024-10-08 | Autonomous Character-Scene Interaction Synthesis from Text Instruction | Nan Jiang et.al. | 2410.03187 | null |
2024-10-02 | Bi-Level Motion Imitation for Humanoid Robots | Wenshuai Zhao et.al. | 2410.01968 | null |
2024-09-30 | Replace Anyone in Videos | Xiang Wang et.al. | 2409.19911 | null |
2024-09-29 | Text-driven Human Motion Generation with Motion Masked Diffusion Model | Xingyu Chen et.al. | 2409.19686 | null |
2024-09-29 | BadHMP: Backdoor Attack against Human Motion Prediction | Chaohui Xu et.al. | 2409.19638 | null |
2024-09-26 | EgoLM: Multi-Modal Language Model of Egocentric Motions | Fangzhou Hong et.al. | 2409.18127 | null |
2024-09-25 | TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans | Aggelina Chatziagapi et.al. | 2409.16666 | null |
2024-09-30 | Unimotion: Unifying 3D Human Motion Synthesis and Understanding | Chuqiao Li et.al. | 2409.15904 | null |
2024-09-27 | CauSkelNet: Causal Representation Learning for Human Behaviour Analysis | Xingrui Gu et.al. | 2409.15564 | null |
2024-09-23 | Built Different: Tactile Perception to Overcome Cross-Embodiment Capability Differences in Collaborative Manipulation | William van den Bogert et.al. | 2409.14896 | null |
2024-09-21 | ExFMan: Rendering 3D Dynamic Humans with Hybrid Monocular Blurry Frames and Events | Kanghao Chen et.al. | 2409.14103 | null |
2024-09-21 | PoseAugment: Generative Human Pose Data Augmentation with Physical Plausibility for IMU-based Motion Capture | Zhuojun Li et.al. | 2409.14101 | link |
2024-09-20 | HMD |
Vladimir Guzov et.al. | 2409.13426 | null |
2024-09-19 | Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction | Sibo Tian et.al. | 2409.12456 | null |
2024-09-18 | Massively Multi-Person 3D Human Motion Forecasting with Scene Context | Felix B Mueller et.al. | 2409.12189 | link |
2024-09-18 | MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion | Kalakonda Sai Shashank et.al. | 2409.12140 | null |
2024-09-18 | Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models | Lorenzo Mandelli et.al. | 2409.11920 | null |
2024-09-18 | A novel pedestrian road crossing simulator for dynamic traffic light scheduling systems | Dayuan Tan et.al. | 2409.11623 | null |
2024-09-16 | Know your limits! Optimize the robot's behavior through self-awareness | Esteve Valls Mascaro et.al. | 2409.10308 | null |
2024-09-13 | Transformer with Controlled Attention for Synchronous Motion Captioning | Karim Radouane et.al. | 2409.09177 | link |
2024-09-12 | Hand-Object Interaction Pretraining from Videos | Himanshu Gaurav Singh et.al. | 2409.08273 | null |
2024-09-12 | GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions | Liang Feng et.al. | 2409.07798 | null |
2024-09-12 | GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution | Liang Feng et.al. | 2409.07752 | null |
2024-09-10 | Human Motion Synthesis_ A Diffusion Approach for Motion Stitching and In-Betweening | Michael Adewole et.al. | 2409.06791 | null |
2024-09-10 | World-Grounded Human Motion Recovery via Gravity-View Coordinates | Zehong Shen et.al. | 2409.06662 | null |
2024-09-14 | HiSC4D: Human-centered interaction and 4D Scene Capture in Large-scale Space Using Wearable IMUs and LiDAR | Yudi Dai et.al. | 2409.04398 | null |
2024-09-05 | HUMOS: Human Motion Model Conditioned on Body Shape | Shashank Tripathi et.al. | 2409.03944 | link |
2024-09-04 | MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos | Junyi Ma et.al. | 2409.02638 | null |
2024-09-05 | Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency | Jianwen Jiang et.al. | 2409.02634 | null |
2024-09-02 | AMG: Avatar Motion Guided Video Generation | Zhangsihao Yang et.al. | 2409.01502 | link |
2024-09-01 | SITUATE: Indoor Human Trajectory Prediction through Geometric Features and Self-Supervised Vision Representation | Luigi Capogrosso et.al. | 2409.00774 | link |
2024-09-01 | MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds | Ziqiang Dang et.al. | 2409.00736 | null |
2024-09-05 | EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System | Bonan Liu et.al. | 2409.00343 | null |
2024-08-30 | EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs | Zhen Fan et.al. | 2408.17168 | null |
2024-08-30 | Temporal and Interactive Modeling for Efficient Human-Human Motion Generation | Yabiao Wang et.al. | 2408.17135 | null |
2024-08-29 | Motion-Driven Neural Optimizer for Prophylactic Braces Made by Distributed Microstructures | Xingjian Han et.al. | 2408.16659 | null |
2024-08-29 | COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation | Jiefeng Li et.al. | 2408.16426 | null |
2024-08-27 | PoseWatch: A Transformer-based Architecture for Human-centric Video Anomaly Detection Using Spatio-temporal Pose Tokenization | Ghazal Alinezhad Noghre et.al. | 2408.15185 | null |
2024-08-23 | T3M: Text Guided 3D Human Motion Synthesis from Speech | Wenshuo Peng et.al. | 2408.12885 | link |
2024-08-22 | Through-the-Wall Radar Human Activity Micro-Doppler Signature Representation Method Based on Joint Boulic-Sinusoidal Pendulum Model | Xiaopeng Yang et.al. | 2408.12077 | null |
2024-08-21 | SynPlay: Importing Real-world Diversity for a Synthetic Human Dataset | Jinsub Yim et.al. | 2408.11814 | null |
2024-08-19 | Modelling the Distribution of Human Motion for Sign Language Assessment | Oliver Cory et.al. | 2408.10073 | null |
2024-08-18 | OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare | Chen Long-fei et.al. | 2408.09409 | null |
2024-08-18 | Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony | Chao Xu et.al. | 2408.09397 | null |
2024-08-15 | Towards Practical Human Motion Prediction with LiDAR Point Clouds | Xiao Han et.al. | 2408.08202 | null |
2024-07-30 | Learning Multi-Modal Whole-Body Control for Real-World Humanoid Robots | Pranay Dugar et.al. | 2408.07295 | null |
2024-08-13 | ViMo: Generating Motions from Casual Videos | Liangdong Qiu et.al. | 2408.06614 | null |
2024-08-12 | Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization | Geuntaek Lim et.al. | 2408.05955 | link |
2024-08-05 | Analyzing Data Efficiency and Performance of Machine Learning Algorithms for Assessing Low Back Pain Physical Rehabilitation Exercises | Aleksa Marusic et.al. | 2408.02855 | null |
2024-08-20 | AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos | Feichi Lu et.al. | 2408.02110 | null |
2024-08-04 | Past Movements-Guided Motion Representation Learning for Human Motion Prediction | Junyu Shi et.al. | 2408.02091 | null |
2024-08-01 | MotionFix: Text-Driven 3D Human Motion Editing | Nikos Athanasiou et.al. | 2408.00712 | null |
2024-08-01 | Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion | Honglei Miao et.al. | 2408.00352 | null |
2024-08-04 | Adding Multimodal Controls to Whole-body Human Motion Generation | Yuxuan Bian et.al. | 2407.21136 | link |
2024-07-28 | Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph | Zhengcen Li et.al. | 2407.19497 | link |
2024-07-27 | Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach | Penghui Wen et.al. | 2407.19244 | link |
2024-07-28 | HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation | Zhenzhi Wang et.al. | 2407.17438 | link |
2024-07-23 | Fusion and Cross-Modal Transfer for Zero-Shot Human Action Recognition | Abhi Kamboj et.al. | 2407.16803 | null |
2024-07-23 | Occlusion-Aware 3D Motion Interpretation for Abnormal Behavior Detection | Su Li et.al. | 2407.16788 | null |
2024-07-23 | Real-Time Interactions Between Human Controllers and Remote Devices in Metaverse | Kan Chen et.al. | 2407.16591 | null |
2024-07-23 | Motion Capture from Inertial and Vision Sensors | Xiaodong Chen et.al. | 2407.16341 | null |
2024-07-22 | Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models | Kent Fujiwara et.al. | 2407.15408 | null |
2024-07-19 | M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models | Seunggeun Chi et.al. | 2407.14502 | null |
2024-07-19 | Stochastic Model Predictive Control with Optimal Linear Feedback for Mobile Robots in Dynamic Environments | Yunfan Gao et.al. | 2407.14220 | null |
2024-07-16 | Imitation of human motion achieves natural head movements for humanoid robots in an active-speaker detection task | Bosong Ding et.al. | 2407.11915 | link |
2024-07-16 | Length-Aware Motion Synthesis via Latent Diffusion | Alessio Sampieri et.al. | 2407.11532 | link |
2024-07-16 | Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction | Guowei Xu et.al. | 2407.11494 | link |
2024-07-15 | WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models | Zijian He et.al. | 2407.10625 | null |
2024-07-15 | Learning Social Cost Functions for Human-Aware Path Planning | Andrea Eirale et.al. | 2407.10547 | link |
2024-07-15 | Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation | Peng Jin et.al. | 2407.10528 | null |
2024-07-15 | SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation | Jordan Juravsky et.al. | 2407.10481 | null |
2024-07-14 | InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation | Zeyu Zhang et.al. | 2407.10061 | link |
2024-07-13 | LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment | Yiming Ren et.al. | 2407.09833 | null |
2024-07-11 | A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights | Wentao Lei et.al. | 2407.08428 | link |
2024-07-08 | CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation | Xinying Guo et.al. | 2407.06188 | null |
2024-07-04 | The path towards contact-based physical human-robot interaction | Mohammad Farajtabar et.al. | 2407.02664 | null |
2024-07-02 | HOIMotion: Forecasting Human Motion During Human-Object Interactions Using Egocentric 3D Object Bounding Boxes | Zhiming Hu et.al. | 2407.02633 | null |
2024-07-02 | Aligning Human Motion Generation with Human Perceptions | Haoru Wang et.al. | 2407.02272 | link |
2024-07-02 | Joint-Dataset Learning and Cross-Consistent Regularization for Text-to-Motion Retrieval | Nicola Messina et.al. | 2407.02104 | null |
2024-07-01 | Task-oriented Over-the-air Computation for Edge-device Co-inference with Balanced Classification Accuracy | Xiang Jiao et.al. | 2407.00955 | null |
2024-06-30 | OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration | Fengyuan Yang et.al. | 2407.00574 | null |
2024-06-28 | MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance | Yuang Zhang et.al. | 2406.19680 | null |
2024-06-27 | CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement | Chengwen Zhang et.al. | 2406.19353 | link |
2024-06-26 | Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models | Xiaolin Hong et.al. | 2406.18159 | null |
2024-06-26 | Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective Face and Body Expressions from Affordable Inputs | Uttaran Bhattacharya et.al. | 2406.18068 | null |
2024-06-25 | Human-Object Interaction from Human-Level Instructions | Zhen Wu et.al. | 2406.17840 | null |
2024-06-24 | Feature Fusion for Human Activity Recognition using Parameter-Optimized Multi-Stage Graph Convolutional Network and Transformer Models | Mohammad Belal et.al. | 2406.16638 | null |
2024-06-24 | Do As I Do: Pose Guided Human Motion Copy | Sifan Wu et.al. | 2406.16601 | null |
2024-06-20 | Inverse optimal control problem in the non autonomous linear-quadratic case | Frédéric Jean et.al. | 2406.14270 | null |
2024-06-17 | Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space | Yuan Wang et.al. | 2406.11253 | null |
2024-06-21 | FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models | Zhikai Zhang et.al. | 2406.10740 | null |
2024-06-15 | HumanPlus: Humanoid Shadowing and Imitation from Humans | Zipeng Fu et.al. | 2406.10454 | null |
2024-06-14 | Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild | Lingni Ma et.al. | 2406.09905 | null |
2024-06-13 | OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning | Tairan He et.al. | 2406.08858 | null |
2024-06-11 | RecMoDiffuse: Recurrent Flow Diffusion for Human Motion Generation | Mirgahney Mohamed et.al. | 2406.07169 | null |
2024-06-10 | Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer | Sigal Raab et.al. | 2406.06508 | link |
2024-06-10 | Human Gaze and Head Rotation during Navigation, Exploration and Object Manipulation in Shared Environments with Robots | Tim Schreiter et.al. | 2406.06300 | null |
2024-06-07 | SMART: Scene-motion-aware human action recognition framework for mental disorder group | Zengyuan Lai et.al. | 2406.04649 | link |
2024-06-03 | PDP: Physics-Based Character Animation via Diffusion Policy | Takara E. Truong et.al. | 2406.00960 | null |
2024-06-02 | Unsupervised Neural Motion Retargeting for Humanoid Teleoperation | Satoshi Yagi et.al. | 2406.00727 | null |
2024-06-02 | T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences | Taeryung Lee et.al. | 2406.00636 | null |
2024-05-30 | MotionLLM: Understanding Human Behaviors from Human Motions and Videos | Ling-Hao Chen et.al. | 2405.20340 | link |
2024-05-30 | RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text | Jiaben Chen et.al. | 2405.20336 | null |
2024-05-30 | SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations | Yujiao Jiang et.al. | 2405.19609 | null |
2024-05-30 | Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction | Xuehao Gao et.al. | 2405.18700 | null |
2024-05-30 | Benchmarking Skeleton-based Motion Encoder Models for Clinical Applications: Estimating Parkinson's Disease Severity in Walking Sequences | Vida Adeli et.al. | 2405.17817 | link |
2024-05-28 | MotionLLM: Multimodal Motion-Language Learning with Large Language Models | Qi Wu et.al. | 2405.17013 | null |
2024-05-27 | A Cross-Dataset Study for Text-based 3D Human Motion Retrieval | Léore Bensabath et.al. | 2405.16909 | null |
2024-05-25 | SuDA: Support-based Domain Adaptation for Sim2Real Motion Capture with Flexible Sensors | Jiawei Fang et.al. | 2405.16152 | null |
2024-05-24 | FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis | Ke Fan et.al. | 2405.15763 | null |
2024-05-24 | Learning Generalizable Human Motion Generator with Reinforcement Learning | Yunyao Mao et.al. | 2405.15541 | null |
2024-05-24 | Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer | Zichen Geng et.al. | 2405.15439 | null |
2024-05-24 | A Systematic Review on Custom Data Gloves | Valerio Belcamino et.al. | 2405.15417 | null |
2024-05-24 | On the Identification of Temporally Causal Representation with Instantaneous Dependence | Zijian Li et.al. | 2405.15325 | null |
2024-05-24 | Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor | Haoxuan Qu et.al. | 2405.15267 | null |
2024-05-23 | Event-based dataset for the detection and classification of manufacturing assembly tasks | Laura Duarte et.al. | 2405.14626 | link |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Towards Using Fast Embedded Model Predictive Control for Human-Aware Predictive Robot Navigation | Till Hielscher et.al. | 2405.12616 | null |
2024-05-21 | Physics-based Scene Layout Generation from Human Motion | Jianan Li et.al. | 2405.12460 | null |
2024-05-23 | Flexible Motion In-betweening with Diffusion Models | Setareh Cohan et.al. | 2405.11126 | null |
2024-05-17 | Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis | Zeyi Zhang et.al. | 2405.09814 | null |
2024-05-16 | Integrating Uncertainty-Aware Human Motion Prediction into Graph-Based Manipulator Motion Planning | Wansong Liu et.al. | 2405.09779 | null |
2024-05-24 | ContourCraft: Learning to Resolve Intersections in Neural Multi-Garment Simulations | Artur Grigorev et.al. | 2405.09522 | null |
2024-05-13 | Generating Human Motion in 3D Scenes from Text Descriptions | Zhi Cen et.al. | 2405.07784 | null |
2024-05-13 | Establishing a Unified Evaluation Framework for Human Motion Generation: A Comparative Analysis of Metrics | Ali Ismail-Fawaz et.al. | 2405.07680 | link |
2024-05-13 | Motion Keyframe Interpolation for Any Human Skeleton via Temporally Consistent Point Cloud Sampling and Reconstruction | Clinton Mo et.al. | 2405.07444 | null |
2024-05-10 | Shape Conditioned Human Motion Generation with Diffusion Model | Kebing Xue et.al. | 2405.06778 | null |
2024-05-09 | A Mixture of Experts Approach to 3D Human Motion Prediction | Edmund Shieh et.al. | 2405.06088 | link |
2024-05-09 | StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework | Yiheng Huang et.al. | 2405.05691 | null |
2024-05-08 | Audio Matters Too! Enhancing Markerless Motion Capture with Audio Signals for String Performance Capture | Yitong Jin et.al. | 2405.04963 | link |
2024-05-08 | WixUp: A General Data Augmentation Framework for Wireless Perception in Tracking of Humans | Yin Li et.al. | 2405.04804 | null |
2024-05-08 | Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches | Qing Yu et.al. | 2405.04771 | null |
2024-05-07 | Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos | Junyi Ma et.al. | 2405.04370 | link |
2024-05-06 | MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization | Massimiliano Pappa et.al. | 2405.03803 | null |
2024-05-06 | LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model | Haowen Sun et.al. | 2405.03485 | link |
2024-05-05 | Multimodal Sense-Informed Prediction of 3D Human Motions | Zhenyu Lou et.al. | 2405.02911 | null |
2024-05-05 | Efficient Text-driven Motion Generation via Latent Consistency Training | Mengxian Hu et.al. | 2405.02791 | link |
2024-05-03 | Physics-informed generative neural networks for RF propagation prediction with application to indoor body perception | Federica Fieramosca et.al. | 2405.02131 | null |
2024-04-30 | MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model | Wenxun Dai et.al. | 2404.19759 | link |
2024-04-30 | PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios | Jingbo Wang et.al. | 2404.19722 | null |
2024-04-30 | Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis | Shivam Mehta et.al. | 2404.19622 | null |
2024-04-30 | Physical Non-inertial Poser (PNP): Modeling Non-inertial Effects in Sparse-inertial Human Motion Capture | Xinyu Yi et.al. | 2404.19619 | null |
2024-04-30 | Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging | Rayan Armani et.al. | 2404.19541 | link |
2024-04-29 | 4D-DRESS: A 4D Dataset of Real-world Human Clothing with Semantic Annotations | Wenbo Wang et.al. | 2404.18630 | link |
2024-04-27 | Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs | Yiming Bao et.al. | 2404.17837 | null |
2024-04-26 | Clustering of Motion Trajectories by a Distance Measure Based on Semantic Features | Christoph Zelch et.al. | 2404.17269 | link |
2024-04-25 | SHINE: Social Homology Identification for Navigation in Crowded Environments | Diego Martinez-Baselga et.al. | 2404.16705 | null |
2024-04-23 | WANDR: Intention-guided Human Motion Generation | Markos Diomataris et.al. | 2404.15383 | null |
2024-04-20 | Efficient Verification of a RADAR SoC Using Formal and Simulation-Based Methods | Aman Kumar et.al. | 2404.15371 | null |
2024-04-19 | A Weight-aware-based Multi-source Unsupervised Domain Adaptation Method for Human Motion Intention Recognition | Xiao-Yin Liu et.al. | 2404.15366 | link |
2024-04-23 | TAAT: Think and Act from Arbitrary Texts in Text2Motion | Runqi Wang et.al. | 2404.14745 | null |
2024-04-21 | MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions | Sheng Yan et.al. | 2404.13657 | link |
2024-04-19 | Purposer: Putting Human Motion Generation in Context | Nicolas Ugrinovic et.al. | 2404.12942 | null |
2024-04-19 | MCM: Multi-condition Motion Synthesis Framework | Zeyu Ling et.al. | 2404.12886 | null |
2024-04-17 | Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion | Xinghan Wang et.al. | 2404.11375 | null |
2024-04-17 | Following the Human Thread in Social Navigation | Luca Scofano et.al. | 2404.11327 | link |
2024-04-16 | HumMUSS: Human Motion Understanding using State Space Models | Arnab Kumar Mondal et.al. | 2404.10880 | null |
2024-04-15 | in2IN: Leveraging individual Information to Generate Human INteractions | Pablo Ruiz Ponce et.al. | 2404.09988 | link |
2024-04-15 | Learning Human Motion from Monocular Videos via Cross-Modal Manifold Alignment | Shuaiying Hou et.al. | 2404.09499 | null |
2024-04-12 | Synthesis of Through-Wall Micro-Doppler Signatures of Human Motions Using Generative Adversarial Networks | Kainat Yasmeen Shobha Sundar Ram et.al. | 2404.08739 | null |
2024-04-12 | EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams | Christen Millerdurai et.al. | 2404.08640 | link |
2024-04-11 | Model Predictive Trajectory Planning for Human-Robot Handovers | Thies Oelerich et.al. | 2404.07505 | null |
2024-04-08 | Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning | Mahsa Ehsanpour et.al. | 2404.05578 | null |
2024-04-08 | Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning | Jaewoo Jeong et.al. | 2404.05218 | link |
2024-04-07 | A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals | Jiangnan Tang et.al. | 2404.04890 | link |
2024-04-05 | PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos | Yufei Zhang et.al. | 2404.04430 | null |
2024-04-04 | Towards more realistic human motion prediction with attention to motion coordination | Pengxiang Ding et.al. | 2404.03584 | null |
2024-04-03 | MotionChain: Conversational Motion Controllers via Multimodal Prompts | Biao Jiang et.al. | 2404.01700 | link |
2024-04-02 | Leveraging Digital Perceptual Technologies for Remote Perception and Analysis of Human Biomechanical Processes: A Contactless Approach for Workload and Joint Force Assessment | Jesudara Omidokun et.al. | 2404.01576 | null |
2024-04-01 | Large Motion Model for Unified Multi-Modal Motion Generation | Mingyuan Zhang et.al. | 2404.01284 | null |
2024-04-02 | SurMo: Surface-based 4D Motion Modeling for Dynamic Human Rendering | Tao Hu et.al. | 2404.01225 | null |
2024-03-29 | A Unified Framework for Human-centric Point Cloud Video Understanding | Yiteng Xu et.al. | 2403.20031 | null |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652 | null |
2024-03-28 | RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method | Ming Yan et.al. | 2403.19501 | null |
2024-03-28 | Beyond Talking -- Generating Holistic 3D Human Dyadic Motion for Communication | Mingze Sun et.al. | 2403.19467 | null |
2024-04-01 | BAMM: Bidirectional Autoregressive Motion Model | Ekkasit Pinyoanuntapong et.al. | 2403.19435 | link |
2024-03-30 | Egocentric Scene-aware Human Trajectory Prediction | Weizhuo Wang et.al. | 2403.19026 | null |
2024-03-26 | Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance | Zan Wang et.al. | 2403.18036 | link |
2024-03-26 | ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis | Muhammad Hamza Mughal et.al. | 2403.17936 | null |
2024-03-30 | MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors | He Zhang et.al. | 2403.17610 | null |
2024-03-28 | Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method | Jie Tian et.al. | 2403.16169 | null |
2024-03-26 | PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling | Xiaoyun Zheng et.al. | 2403.16080 | link |
2024-03-23 | Human Motion Prediction under Unexpected Perturbation | Jiangbei Yue et.al. | 2403.15891 | null |
2024-03-23 | Contact-aware Human Motion Generation from Textual Descriptions | Sihan Ma et.al. | 2403.15709 | null |
2024-03-22 | GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner | Haoxuan Qu et.al. | 2403.14947 | null |
2024-03-21 | HCTO: Optimality-Aware LiDAR Inertial Odometry with Hybrid Continuous Time Optimization for Compact Wearable Mapping System | Jianping Li et.al. | 2403.14173 | link |
2024-03-21 | Existence Is Chaos: Enhancing 3D Human Motion Prediction with Uncertainty Consideration | Zhihao Wang et.al. | 2403.14104 | null |
2024-03-20 | CoMo: Controllable Motion Generation through Language Guided Pose Code Editing | Yiming Huang et.al. | 2403.13900 | null |
2024-03-20 | LaCE-LHMP: Airflow Modelling-Inspired Long-Term Human Motion Prediction By Enhancing Laminar Characteristics in Human Flow | Yufei Zhu et.al. | 2403.13640 | link |
2024-03-21 | LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment | Peishan Cong et.al. | 2403.13307 | link |
2024-03-20 | Map-Aware Human Pose Prediction for Robot Follow-Ahead | Qingyuan Jiang et.al. | 2403.13294 | null |
2024-03-19 | WHAC: World-grounded Humans and Cameras | Wanqi Yin et.al. | 2403.12959 | link |
2024-03-18 | Graph-Jigsaw Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection | Ali Karami et.al. | 2403.12172 | null |
2024-03-18 | UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling | Yujiao Jiang et.al. | 2403.11589 | null |
2024-03-17 | FORCE: Dataset and Method for Intuitive Physics Guided Human-object Interaction | Xiaohan Zhang et.al. | 2403.11237 | null |
2024-03-17 | THOR: Text to Human-Object Interaction Diffusion via Relation Intervention | Qianyang Wu et.al. | 2403.11208 | null |
2024-03-14 | GazeMotion: Gaze-guided Human Motion Forecasting | Zhiming Hu et.al. | 2403.09885 | null |
2024-03-14 | THÖR-MAGNI: A Large-scale Indoor Motion Capture Recording of Human Movement and Robot Interaction | Tim Schreiter et.al. | 2403.09285 | link |
2024-03-13 | Scaling Up Dynamic Human-Scene Interaction Modeling | Nan Jiang et.al. | 2403.08629 | null |
2024-03-12 | DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation | Chen Wang et.al. | 2403.07788 | null |
2024-03-19 | Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM | Zeyu Zhang et.al. | 2403.07487 | link |
2024-03-10 | Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation | Paweł A. Pierzchlewicz et.al. | 2403.06164 | link |
2024-03-09 | MATRIX: Multi-Agent Trajectory Generation with Diverse Contexts | Zhuo Xu et.al. | 2403.06041 | null |
2024-03-09 | Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information | Qiaochu Huang et.al. | 2403.05834 | link |
2024-03-08 | Integrating Predictive Motion Uncertainties with Distributionally Robust Risk-Aware Control for Safe Robot Navigation in Crowds | Kanghyun Ryu et.al. | 2403.05081 | link |
2024-03-11 | Fooling Neural Networks for Motion Forecasting via Adversarial Attacks | Edgar Medina et.al. | 2403.04954 | null |
2024-03-06 | HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations | Peng Dai et.al. | 2403.03561 | null |
2024-03-01 | Tri-Modal Motion Retrieval by Learning a Joint Embedding Space | Kangning Yin et.al. | 2403.00691 | null |
2024-02-21 | Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting | Edgar Medina et.al. | 2402.19237 | link |
2024-02-29 | MOSAIC: A Modular System for Assistive and Interactive Cooking | Huaxiaoyue Wang et.al. | 2402.18796 | null |
2024-02-27 | SocialCVAE: Predicting Pedestrian Trajectory via Interaction Conditioned Latents | Wei Xiang et.al. | 2402.17339 | link |
2024-02-27 | LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment | Yiming Ren et.al. | 2402.17171 | null |
2024-03-06 | Expressive Whole-Body Control for Humanoid Robots | Xuxin Cheng et.al. | 2402.16796 | null |
2024-02-23 | Seamless Human Motion Composition with Blended Positional Encodings | German Barquero et.al. | 2402.15509 | link |
2024-03-05 | 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data | Zhi-Yi Lin et.al. | 2402.13172 | null |
2024-02-20 | A Recurrent Neural Network Enhanced Unscented Kalman Filter for Human Motion Prediction | Wansong Liu et.al. | 2402.13045 | null |
2024-02-19 | Human Video Translation via Query Warping | Haiming Zhu et.al. | 2402.12099 | null |
2024-02-04 | Custom IMU-Based Wearable System for Robust 2.4 GHz Wireless Human Body Parts Orientation Tracking and 3D Movement Visualization on an Avatar | Javier González-Alonso et.al. | 2402.09459 | null |
2024-01-30 | Progress in artificial intelligence applications based on the combination of self-driven sensors and deep learning | Weixiang Wan et.al. | 2402.09442 | null |
2024-02-13 | Approximately Piecewise E(3) Equivariant Point Networks | Matan Atzmon et.al. | 2402.08529 | null |
2024-02-11 | Self-Correcting Self-Consuming Loops for Generative Model Training | Nate Gillman et.al. | 2402.07087 | link |
2024-02-06 | Bidirectional Autoregressive Diffusion Model for Dance Generation | Canyu Zhang et.al. | 2402.04356 | link |
2024-02-06 | Novel IMU-based Adaptive Estimator of the Center of Rotation of Joints for Movement Analysis | Sara García-de-Villa et.al. | 2402.04240 | null |
2024-02-05 | Replication of Impedance Identification Experiments on a Reinforcement-Learning-Controlled Digital Twin of Human Elbows | Hao Yu et.al. | 2402.02904 | null |
2024-02-01 | Transferring human emotions to robot motions using Neural Policy Style Transfer | Raul Fernandez-Fernandez et.al. | 2402.00663 | null |
2024-01-25 | Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks | Tianhe Ren et.al. | 2401.14159 | link |
2024-01-24 | Generative Human Motion Stylization in Latent Space | Chuan Guo et.al. | 2401.13505 | null |
2024-01-24 | GTAutoAct: An Automatic Datasets Generation Framework Based on Game Engine Redevelopment for Action Recognition | Xingyu Song et.al. | 2401.13414 | null |
2024-01-23 | Workspace Optimization Techniques to Improve Prediction of Human Motion During Human-Robot Collaboration | Yi-Shiuan Tung et.al. | 2401.12965 | null |
2024-01-23 | Inertial Sensors for Human Motion Analysis: A Comprehensive Review | Sara García-de-Villa et.al. | 2401.12919 | null |
2024-01-23 | A database of physical therapy exercises with variability of execution collected by wearable sensors | Sara García-de-Villa et.al. | 2401.12868 | null |
2024-01-22 | Full-Body Motion Reconstruction with Sparse Sensing from Graph Perspective | Feiyu Yao et.al. | 2401.11783 | link |
2024-01-24 | MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation | Nhat M. Hoang et.al. | 2401.11115 | link |
2024-01-19 | Equivariant Graph Neural Operator for Modeling 3D Dynamics | Minkai Xu et.al. | 2401.11037 | link |
2024-01-16 | RoHM: Robust Human Motion Reconstruction via Diffusion | Siwei Zhang et.al. | 2401.08570 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-03 | Controlling your Attributes in Voice | Xuyuan Li et.al. | 2501.01674 | null |
2025-01-02 | Object-level Visual Prompts for Compositional Image Generation | Gaurav Parmar et.al. | 2501.01424 | null |
2025-01-02 | On Unifying Video Generation and Camera Pose Estimation | Chun-Hao Paul Huang et.al. | 2501.01409 | null |
2025-01-02 | ProjectedEx: Enhancing Generation in Explainable AI for Prostate Cancer | Xuyin Qi et.al. | 2501.01392 | link |
2025-01-02 | Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement | Z. Zhang et.al. | 2501.01368 | null |
2025-01-02 | LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge | Kyoungkook Kang et.al. | 2501.01197 | null |
2025-01-02 | HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment | Zitong Xu et.al. | 2501.01116 | null |
2025-01-02 | EliGen: Entity-Level Controlled Image Generation with Regional Attention | Hong Zhang et.al. | 2501.01097 | link |
2025-01-01 | OASIS Uncovers: High-Quality T2I Models, Same Old Stereotypes | Sepehr Dehdashtian et.al. | 2501.00962 | null |
2025-01-01 | Enhancing Early Diabetic Retinopathy Detection through Synthetic DR1 Image Generation: A StyleGAN3 Approach | Sagarnil Das et.al. | 2501.00954 | null |
2025-01-01 | Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion | Hao Wang et.al. | 2501.00944 | null |
2025-01-02 | Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation | Yuanbo Yang et.al. | 2412.21117 | null |
2024-12-30 | Quantum Diffusion Model for Quark and Gluon Jet Generation | Mariia Baidachna et.al. | 2412.21082 | link |
2025-01-02 | Edicho: Consistent Image Editing in the Wild | Qingyan Bai et.al. | 2412.21079 | link |
2024-12-30 | Varformer: Adapting VAR's Generative Prior for Image Restoration | Siyang Wang et.al. | 2412.21063 | link |
2024-12-30 | VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control | Shaojin Wu et.al. | 2412.20800 | link |
2024-12-30 | HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images | Sungik Choi et.al. | 2412.20704 | null |
2024-12-30 | Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis | Yousef Yeganeh et.al. | 2412.20651 | null |
2024-12-29 | Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond) | Tomer Garber et.al. | 2412.20596 | null |
2024-12-29 | Diff4MMLiTS: Advanced Multimodal Liver Tumor Segmentation via Diffusion-Based Image Synthesis and Alignment | Shiyun Chen et.al. | 2412.20418 | null |
2024-12-29 | Open-Sora: Democratizing Efficient Video Production for All | Zangwei Zheng et.al. | 2412.20404 | link |
2024-12-27 | P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision | Junjie Hu et.al. | 2412.19533 | null |
2024-12-27 | Focusing Image Generation to Mitigate Spurious Correlations | Xuewei Li et.al. | 2412.19457 | null |
2024-12-25 | UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation | Lunhao Duan et.al. | 2412.18928 | null |
2024-12-25 | DiFiC: Your Diffusion Model Holds the Secret to Fine-Grained Clustering | Ruohong Yang et.al. | 2412.18838 | null |
2024-12-25 | DebiasDiff: Debiasing Text-to-image Diffusion Models with Self-discovering Latent Attribute Directions | Yilei Jiang et.al. | 2412.18810 | null |
2024-12-25 | DRDM: A Disentangled Representations Diffusion Model for Synthesizing Realistic Person Images | Enbo Huang et.al. | 2412.18797 | null |
2024-12-25 | Protective Perturbations against Unauthorized Data Usage in Diffusion-based Image Generation | Sen Peng et.al. | 2412.18791 | null |
2024-12-25 | Elucidating Flow Matching ODE Dynamics with respect to Data Geometries | Gal Mishne et.al. | 2412.18730 | null |
2024-12-24 | 1.58-bit FLUX | Chenglin Yang et.al. | 2412.18653 | null |
2024-12-24 | Dissecting CLIP: Decomposition with a Schur Complement-based Approach | Azim Ospanov et.al. | 2412.18645 | link |
2024-12-24 | Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models | Qice Qin et.al. | 2412.18421 | null |
2024-12-24 | Extract Free Dense Misalignment from CLIP | JeongYeon Nam et.al. | 2412.18404 | link |
2024-12-24 | RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction | Wu Xiaoping et.al. | 2412.18390 | null |
2024-12-24 | TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization | Yucong Luo et.al. | 2412.18185 | null |
2024-12-24 | EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation | Shuhao Han et.al. | 2412.18150 | link |
2024-12-24 | Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction | Xiao Guo et.al. | 2412.18149 | null |
2024-12-24 | Ensuring Consistency for In-Image Translation | Chengpeng Fu et.al. | 2412.18139 | null |
2024-12-23 | The Superposition of Diffusion Models Using the Itô Density Estimator | Marta Skreta et.al. | 2412.17762 | null |
2024-12-23 | Personalized Large Vision-Language Models | Chau Pham et.al. | 2412.17610 | null |
2024-12-23 | Discriminative Image Generation with Diffusion Models for Zero-Shot Learning | Dingjie Fu et.al. | 2412.17219 | null |
2024-12-22 | Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching | Enshu Liu et.al. | 2412.17153 | link |
2024-12-22 | Similarity Trajectories: Linking Sampling Process to Artifacts in Diffusion-Generated Images | Dennis Menn et.al. | 2412.17109 | null |
2024-12-22 | DreamOmni: Unified Image Generation and Editing | Bin Xia et.al. | 2412.17098 | null |
2024-12-22 | Modular Conversational Agents for Surveys and Interviews | Jiangbo Yu et.al. | 2412.17049 | null |
2024-12-22 | HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories | Eric Hedlin et.al. | 2412.17040 | null |
2024-12-22 | Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation | Quan Dao et.al. | 2412.16906 | null |
2024-12-22 | Diffusion-Based Approaches in Medical Image Generation and Analysis | Abdullah al Nomaan Nafi et.al. | 2412.16860 | null |
2024-12-20 | Personalized Representation from Personalized Generation | Shobhita Sundaram et.al. | 2412.16156 | link |
2024-12-20 | NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems | Laura Weihl et.al. | 2412.16141 | null |
2024-12-20 | CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up | Songhua Liu et.al. | 2412.16112 | link |
2024-12-20 | SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation | Jiadong Pan et.al. | 2412.16039 | null |
2024-12-20 | Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation | Kai Brandenbusch et.al. | 2412.15853 | null |
2024-12-20 | Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance | Hyunsoo Lee et.al. | 2412.15798 | null |
2024-12-20 | PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium | Xinzhe Li et.al. | 2412.15674 | null |
2024-12-20 | BS-LDM: Effective Bone Suppression in High-Resolution Chest X-Ray Images with Conditional Latent Diffusion Models | Yifei Sun et.al. | 2412.15670 | link |
2024-12-20 | SemDP: Semantic-level Differential Privacy Protection for Face Datasets | Xiaoting Zhang et.al. | 2412.15590 | null |
2024-12-20 | Stylish and Functional: Guided Interpolation Subject to Physical Constraints | Yan-Ying Chen et.al. | 2412.15507 | null |
2024-12-19 | UIP2P: Unsupervised Instruction-based Image Editing via Cycle Edit Consistency | Enis Simsar et.al. | 2412.15216 | null |
2024-12-19 | Flowing from Words to Pixels: A Framework for Cross-Modality Evolution | Qihao Liu et.al. | 2412.15213 | null |
2024-12-19 | FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching | Sucheng Ren et.al. | 2412.15205 | link |
2024-12-19 | LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation | Weijia Shi et.al. | 2412.15188 | null |
2024-12-19 | Tiled Diffusion | Or Madar et.al. | 2412.15185 | null |
2024-12-19 | DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space | Mang Ning et.al. | 2412.15032 | link |
2024-12-19 | Qua |
Keith G. Mills et.al. | 2412.14628 | null |
2024-12-19 | DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On | Wengyi Zhan et.al. | 2412.14465 | null |
2024-12-19 | Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion | Jixuan He et.al. | 2412.14462 | link |
2024-12-19 | LEDiff: Latent Exposure Diffusion for HDR Generation | Chao Wang et.al. | 2412.14456 | null |
2024-12-18 | E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling | Zhihang Yuan et.al. | 2412.14170 | null |
2024-12-18 | Autoregressive Video Generation without Vector Quantization | Haoge Deng et.al. | 2412.14169 | link |
2024-12-18 | FashionComposer: Compositional Fashion Image Generation | Sihui Ji et.al. | 2412.14168 | null |
2024-12-18 | VideoDPO: Omni-Preference Alignment for Video Diffusion Generation | Runtao Liu et.al. | 2412.14167 | null |
2024-12-18 | Text2Relight: Creative Portrait Relighting with Text Guidance | Junuk Cha et.al. | 2412.13734 | null |
2024-12-18 | Diffusion models and stochastic quantisation in lattice field theory | Gert Aarts et.al. | 2412.13704 | null |
2024-12-18 | MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing | Chuang Yang et.al. | 2412.13684 | null |
2024-12-18 | Self-control: A Better Conditional Mechanism for Masked Autoregressive Model | Qiaoying Qu et.al. | 2412.13635 | null |
2024-12-17 | Posterior Mean Matching: Generative Modeling through Online Bayesian Inference | Sebastian Salazar et.al. | 2412.13286 | null |
2024-12-17 | F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration | Lu Liu et.al. | 2412.13155 | null |
2024-12-17 | Prompt Augmentation for Self-supervised Text-guided Image Manipulation | Rumeysa Bodur et.al. | 2412.13081 | null |
2024-12-17 | 3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation | Haoshen Wang et.al. | 2412.13059 | null |
2024-12-17 | Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression | Ruijie Chen et.al. | 2412.12982 | null |
2024-12-17 | Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance | Wenhao Sun et.al. | 2412.12974 | link |
2024-12-17 | Unsupervised Region-Based Image Editing of Denoising Diffusion Models | Zixiang Li et.al. | 2412.12912 | null |
2024-12-17 | ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction | Zhongjie Duan et.al. | 2412.12888 | link |
2024-12-17 | Rethinking Diffusion-Based Image Generators for Fundus Fluorescein Angiography Synthesis on Limited Data | Chengzhou Yu et.al. | 2412.12778 | null |
2024-12-17 | Guided and Variance-Corrected Fusion with One-shot Style Alignment for Large-Content Image Generation | Shoukun Sun et.al. | 2412.12771 | link |
2024-12-17 | Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration | Xinlong Cheng et.al. | 2412.12550 | null |
2024-12-16 | Causal Diffusion Transformers for Generative Modeling | Chaorui Deng et.al. | 2412.12095 | link |
2024-12-16 | A LoRA is Worth a Thousand Pictures | Chenxi Liu et.al. | 2412.12048 | null |
2024-12-16 | IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation | Yiren Song et.al. | 2412.11638 | null |
2024-12-16 | 3D |
Zichen Tang et.al. | 2412.11599 | link |
2024-12-16 | VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis | Zhipeng Chen et.al. | 2412.11594 | link |
2024-12-16 | LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model | Xi Wang et.al. | 2412.11519 | null |
2024-12-16 | FedCAR: Cross-client Adaptive Re-weighting for Generative Models in Federated Learning | Minjun Kim et.al. | 2412.11463 | link |
2024-12-16 | Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models | Namhyuk Ahn et.al. | 2412.11423 | null |
2024-12-16 | Relation-Guided Adversarial Learning for Data-free Knowledge Transfer | Yingping Liang et.al. | 2412.11380 | link |
2024-12-15 | Sonicmesh: Enhancing 3D Human Mesh Reconstruction in Vision-Impaired Environments With Acoustic Signals | Xiaoxuan Liang et.al. | 2412.11325 | null |
2024-12-13 | OP-LoRA: The Blessing of Dimensionality | Piotr Teterwak et.al. | 2412.10362 | null |
2024-12-13 | BrushEdit: All-In-One Image Inpainting and Editing | Yaowei Li et.al. | 2412.10316 | null |
2024-12-13 | Learning Complex Non-Rigid Image Edits from Multimodal Conditioning | Nikolai Warner et.al. | 2412.10219 | null |
2024-12-13 | Simple Guidance Mechanisms for Discrete Diffusion Models | Yair Schiff et.al. | 2412.10193 | link |
2024-12-13 | Financial Fine-tuning a Large Time Series Model | Xinghong Fu et.al. | 2412.09880 | link |
2024-12-12 | Human vs. AI: A Novel Benchmark and a Comparative Study on the Detection of Generated Images and the Impact of Prompts | Philipp Moeßner et.al. | 2412.09715 | link |
2024-12-12 | Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation | Chun-Mei Feng et.al. | 2412.09706 | link |
2024-12-12 | LoRACLR: Contrastive Adaptation for Customization of Diffusion Models | Enis Simsar et.al. | 2412.09622 | null |
2024-12-12 | EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Zhuofan Zong et.al. | 2412.09618 | null |
2024-12-12 | Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG | Kavana Venkatesh et.al. | 2412.09614 | null |
2024-12-12 | FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers | Yusuf Dalva et.al. | 2412.09611 | null |
2024-12-12 | Spectral Image Tokenizer | Carlos Esteves et.al. | 2412.09607 | null |
2024-12-12 | Are Conditional Latent Diffusion Models Effective for Image Restoration? | Yunchen Yuan et.al. | 2412.09324 | null |
2024-12-12 | RAD: Region-Aware Diffusion Models for Image Inpainting | Sora Kim et.al. | 2412.09191 | null |
2024-12-12 | LVMark: Robust Watermark for latent video diffusion models | MinHyuk Jang et.al. | 2412.09122 | null |
2024-12-12 | ViUniT: Visual Unit Tests for More Robust Visual Programming | Artemis Panagopoulou et.al. | 2412.08859 | null |
2024-12-11 | Silvan Fischbacher et.al. | 2412.08716 | null | |
2024-12-11 | Fast Prompt Alignment for Text-to-Image Generation | Khalil Mrini et.al. | 2412.08639 | link |
2024-12-11 | Multimodal Latent Language Modeling with Next-Token Diffusion | Yutao Sun et.al. | 2412.08635 | link |
2024-12-11 | LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations | Zejian Li et.al. | 2412.08580 | link |
2024-12-11 | Learning Flow Fields in Attention for Controllable Person Image Generation | Zijian Zhou et.al. | 2412.08486 | link |
2024-12-11 | InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models | Min Hou et.al. | 2412.08480 | link |
2024-12-11 | CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis | Mu Zhang et.al. | 2412.08464 | null |
2024-12-11 | Analyzing and Improving Model Collapse in Rectified Flow Models | Huminhao Zhu et.al. | 2412.08175 | null |
2024-12-11 | AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting | Zihao Han et.al. | 2412.08149 | null |
2024-12-11 | Seeing Syntax: Uncovering Syntactic Learning Limitations in Vision-Language Models | Sri Harsha Dumpala et.al. | 2412.08111 | null |
2024-12-11 | Generative Zoo | Tomasz Niewiadomski et.al. | 2412.08101 | null |
2024-12-10 | UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics | Xi Chen et.al. | 2412.07774 | null |
2024-12-10 | FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models | Tong Wu et.al. | 2412.07674 | null |
2024-12-10 | DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation | Jianzong Wu et.al. | 2412.07589 | null |
2024-12-10 | StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization | Jinlu Zhang et.al. | 2412.07375 | link |
2024-12-10 | Fusion Embedding for Pose-Guided Person Image Synthesis with Diffusion Model | Donghwna Lee et.al. | 2412.07333 | null |
2024-12-10 | A Generative Victim Model for Segmentation | Aixuan Li et.al. | 2412.07274 | null |
2024-12-10 | Buster: Incorporating Backdoor Attacks into Text Encoder to Mitigate NSFW Content Generation | Xin Zhao et.al. | 2412.07249 | null |
2024-12-10 | Moderating the Generalization of Score-based Generative Model | Wan Jiang et.al. | 2412.07229 | null |
2024-12-10 | Fine-grained Text to Image Synthesis | Xu Ouyang et.al. | 2412.07196 | null |
2024-12-10 | FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error | Beilin Chu et.al. | 2412.07140 | null |
2024-12-09 | Visual Lexicon: Rich Image Features in Language Space | XuDong Wang et.al. | 2412.06774 | null |
2024-12-09 | Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty | Meera Hahn et.al. | 2412.06771 | link |
2024-12-09 | ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet | Andrei-Robert Alexandrescu et.al. | 2412.06742 | null |
2024-12-09 | EMOv2: Pushing 5M Vision Model Frontier | Jiangning Zhang et.al. | 2412.06674 | link |
2024-12-09 | ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance | Chunwei Wang et.al. | 2412.06673 | null |
2024-12-09 | Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion | Shuaiting Li et.al. | 2412.06661 | null |
2024-12-09 | PrEditor3D: Fast and Precise 3D Shape Editing | Ziya Erkoç et.al. | 2412.06592 | null |
2024-12-09 | MoViE: Mobile Diffusion for Video Editing | Adil Karjauv et.al. | 2412.06578 | null |
2024-12-09 | Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment | Kim Sung-Bin et.al. | 2412.06209 | null |
2024-12-09 | ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance | Yuming Li et.al. | 2412.06163 | null |
2024-12-06 | LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation | Donald Shenaj et.al. | 2412.05148 | null |
2024-12-06 | The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation | Ruoyu Wang et.al. | 2412.05101 | null |
2024-12-06 | Noise Matters: Diffusion Model-based Urban Mobility Generation with Collaborative Noise Priors | Yuheng Zhang et.al. | 2412.05000 | null |
2024-12-06 | Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction | Gaurav Shrivastava et.al. | 2412.04929 | null |
2024-12-06 | Addressing Attribute Leakages in Diffusion-based Image Editing without Training | Sunung Mun et.al. | 2412.04715 | null |
2024-12-05 | Hidden in the Noise: Two-Stage Robust Watermarking for Images | Kasra Arabi et.al. | 2412.04653 | null |
2024-12-05 | One Communication Round is All It Needs for Federated Fine-Tuning Foundation Models | Ziyao Wang et.al. | 2412.04650 | null |
2024-12-05 | Action-based image editing guided by human instructions | Maria Mihaela Trusca et.al. | 2412.04558 | null |
2024-12-05 | LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors | Yusuf Dalva et.al. | 2412.04460 | null |
2024-12-05 | Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis | Jian Han et.al. | 2412.04431 | link |
2024-12-05 | Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction | George Webber et.al. | 2412.04324 | null |
2024-12-05 | The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation | Fredrik Carlsson et.al. | 2412.04318 | null |
2024-12-05 | SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion | Trong-Tung Nguyen et.al. | 2412.04301 | null |
2024-12-05 | T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts | Ziwei Huang et.al. | 2412.04300 | null |
2024-12-05 | Structure-Aware Stylized Image Synthesis for Robust Medical Image Segmentation | Jie Bao et.al. | 2412.04296 | link |
2024-12-05 | HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing | Jinbin Bai et.al. | 2412.04280 | link |
2024-12-05 | AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models | Xinghui Li et.al. | 2412.04146 | null |
2024-12-05 | BodyMetric: Evaluating the Realism of HumanBodies in Text-to-Image Generation | Nefeli Andreou et.al. | 2412.04086 | null |
2024-12-04 | MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation | Zehuan Huang et.al. | 2412.03558 | null |
2024-12-04 | Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective | Neta Shaul et.al. | 2412.03487 | null |
2024-12-04 | Skel3D: Skeleton Guided Novel View Synthesis | Aron Fóthi et.al. | 2412.03407 | null |
2024-12-04 | Implicit Priors Editing in Stable Diffusion via Targeted Token Adjustment | Feng He et.al. | 2412.03400 | null |
2024-12-04 | DIVE: Taming DINO for Subject-Driven Video Editing | Yi Huang et.al. | 2412.03347 | null |
2024-12-04 | Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis | Tao Jun Lin et.al. | 2412.03315 | null |
2024-12-04 | Is JPEG AI going to change image forensics? | Edoardo Daniele Cannas et.al. | 2412.03261 | null |
2024-12-04 | DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation | Qingdong He et.al. | 2412.03255 | null |
2024-12-04 | Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation | Gianni Franchi et.al. | 2412.03178 | null |
2024-12-04 | PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation | Qihan Huang et.al. | 2412.03177 | link |
2024-12-03 | Motion Prompting: Controlling Video Generation with Motion Trajectories | Daniel Geng et.al. | 2412.02700 | null |
2024-12-03 | Taming Scalable Visual Tokenizer for Autoregressive Image Generation | Fengyuan Shi et.al. | 2412.02692 | link |
2024-12-03 | FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation | Kefan Chen et.al. | 2412.02690 | null |
2024-12-03 | SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance | Viet Nguyen et.al. | 2412.02687 | null |
2024-12-03 | MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis | Tianyu Wang et.al. | 2412.02635 | null |
2024-12-03 | WEM-GAN: Wavelet transform based facial expression manipulation | Dongya Sun et.al. | 2412.02530 | null |
2024-12-03 | ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation? | Leixin Zhang et.al. | 2412.02368 | link |
2024-12-03 | GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing | Khawar Islam et.al. | 2412.02366 | null |
2024-12-03 | Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Jungwon Park et.al. | 2412.02237 | link |
2024-12-03 | GIST: Towards Photorealistic Style Transfer via Multiscale Geometric Representations | Renan A. Rojas-Gomez et.al. | 2412.02214 | null |
2024-11-29 | JetFormer: An Autoregressive Generative Model of Raw Images and Text | Michael Tschannen et.al. | 2411.19722 | null |
2024-11-29 | TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting | Bojun Xiong et.al. | 2411.19654 | null |
2024-11-29 | Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing | Wenyi Mo et.al. | 2411.19652 | link |
2024-11-29 | QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain | Wenfang Sun et.al. | 2411.19534 | null |
2024-11-29 | Retrieval-guided Cross-view Image Synthesis | Hongji Yang et.al. | 2411.19510 | null |
2024-11-29 | Achromatic single-layer hologram | Zhi Li et.al. | 2411.19445 | null |
2024-11-28 | AMO Sampler: Enhancing Text Rendering with Overshooting | Xixi Hu et.al. | 2411.19415 | null |
2024-11-28 | DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models | Shwetha Ram et.al. | 2411.19390 | null |
2024-11-28 | Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention | Huiguo He et.al. | 2411.19261 | null |
2024-11-28 | SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generation | Yuhan Pei et.al. | 2411.19182 | null |
2024-11-27 | Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Shengqu Cai et.al. | 2411.18616 | null |
2024-11-27 | FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion | Haosen Yang et.al. | 2411.18552 | null |
2024-11-27 | TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models | Riza Velioglu et.al. | 2411.18350 | link |
2024-11-27 | Prediction with Action: Visual Policy Learning via Joint Denoising Process | Yanjiang Guo et.al. | 2411.18179 | null |
2024-11-27 | Type-R: Automatically Retouching Typos for Text-to-Image Generation | Wataru Shimoda et.al. | 2411.18159 | null |
2024-11-27 | PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion | Gwanghyun Kim et.al. | 2411.18068 | null |
2024-11-27 | Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models | Shuyang Hao et.al. | 2411.18000 | null |
2024-11-26 | Generative Image Layer Decomposition with Visual Effects | Jinrui Yang et.al. | 2411.17864 | null |
2024-11-27 | Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space | Lingxiao Li et.al. | 2411.17784 | null |
2024-11-26 | An Ensemble Approach for Brain Tumor Segmentation and Synthesis | Juampablo E. Heras Rivera et.al. | [2411.17617](http://arxiv.org/abs/2411.17 |