AI千集
首页热点资讯知识笔记阅读频道关于我们知识库客户端下载
登录
  • 最新
  • 推荐
  • AI医疗
  • AI写作
  • AI应用
  • AI基础
  • AI量化
  • AI数据
  • AI画图
  • AIGC
  • 编程基础
  • 课程演讲
  • 资料仓库
  • AI写作文:(cat,girl,robot)

    gingo 2023-08-23 20:43:59 AIGC
    0 / 2229
  • AI写作文:(cat, hat, mouse)

    gingo 2023-08-23 20:39:11 AIGC
    0 / 2167
  • AI写作文:(dog,ball,water)

    gingo 2023-08-23 20:36:40 AIGC
    0 / 2155
  • 后端学习值得收藏的一些开源电子书

    openoker 2023-08-23 15:52:14 资料仓库 电子书
    0 / 2148
  • Taro(React+TS)基于InnerAudioContext封装一个基本的音频组件

    openoker 2023-08-21 17:54:42 资料仓库 Taro前端
    0 / 2254
  • AIGC网站汇总

    gingo 2023-08-09 22:01:54 AI应用 AI应用
    0 / 2751
  • 怎么训出穿衣服脱衣服的lora

    weilaiweiding 2023-08-04 15:06:35 资料仓库 Lora
    1
    0 / 3350
  • 训练cosplay服饰lora(转载)

    weilaiweiding 2023-08-04 14:58:48 AI画图
    0 / 3047
  • 语音转语音技术之Retrieval-based-Voice-Conversion-WebUI

    openoker 2023-08-03 19:47:25 AIGC 音乐生成
    0 / 2268
  • 面向开发人员的 ChatGPT 提示工程课程(限时免费 吴恩达 )

    weilaiweiding 2023-07-28 17:35:29 课程演讲
    0 / 2322
  • 猫猫什么的最可爱了

    weilaiweiding 2023-07-28 16:56:18 AI画图
    0 / 2043
  • 可爱旗袍女孩

    weilaiweiding 2023-07-28 16:49:41 AI画图
    0 / 2190
  • Tags基本编写逻辑及三段术式入门与解析v3(zz)

    gingo 2023-07-28 14:36:51 AI画图
    0 / 3756
  • 儿童版:人工智能到底是什么

    gingo 2023-07-28 11:47:22 AI基础
    0 / 1948
  • 算法工程师技术路线

    gingo 2023-07-28 11:16:38 课程演讲
    0 / 2077
  • 学习常用Prompt

    openoker 2023-07-03 10:06:53 AIGC 大模型提示词
    0 / 2615
  • 【史上最全AI作图入手教程】:打造AI作图studio之工具使用(zz)

    gingo 2023-06-03 16:42:42 AI应用 AI绘画
    0 / 2638
  • JavaScript工具函数

    openoker 2023-05-09 10:11:34 编程基础 javascript
    0 / 1974
  • python异步编程之asyncio(百万并发)

    openoker 2023-05-04 16:56:44 编程基础 python
    0 / 1984
  • 利用chatgpt生成指令集

    openoker 2023-05-04 15:51:50 资料仓库 指令集
    0 / 2613
  • 寻找那些ChatGPT/GPT4开源“平替”们

    openoker 2023-04-29 10:50:49 资料仓库 chatgpt
    0 / 2175
  • Improving Diffusers Package for High-Quality Image Generation

    gingo 2023-04-23 15:34:12 AI画图
    0 / 2237
  • 大模型训练流程

    openoker 2023-04-18 10:59:36 资料仓库 深度学习大模型
    1
    0 / 2349
  • chatgpt的训练数据集

    openoker 2023-04-17 11:41:57 资料仓库 数据集
    0 / 1989
  • 大模型微调代码解析,哪些方法可以加速训练?

    openoker 2023-04-06 15:53:15 资料仓库 大模型
    0 / 2095
  • 在SD中,如何通过cutoff插件控制latent来给指定部分颜色(给人换衣服)

    gingo 2023-04-03 14:24:14 AI画图
    0 / 3542
  • ChatGPT复现之路

    openoker 2023-03-27 11:37:09 资料仓库 chatgpt大模型
    0 / 2261
  • NLP数据集整理

    openoker 2023-03-09 16:42:12 AI数据 数据集
    0 / 2027
  • python实现过滤敏感词

    openoker 2023-03-07 14:28:46 资料仓库 python经典算法
    0 / 1989
  • T5 PEGASUS:开源一个中文生成式预训练模型

    openoker 2023-02-23 15:41:41 资料仓库 NLP
    0 / 2199
  • FastAPI教程

    openoker 2023-02-20 16:47:35 编程基础 python
    0 / 2189
  • ChatGPT笔记

    openoker 2023-02-16 17:04:25 资料仓库 chatgpt
    0 / 2540
  • 如何微调GPT-J-基础知识

    openoker 2023-02-16 11:51:15 资料仓库 GPTGPTJ
    0 / 1710
  • 在Google Colab或同等配置GPU服务器上微调GPT-J-6B

    openoker 2023-02-15 17:32:24 资料仓库 GPTJ
    0 / 1974
  • WSL2 网络异常排查

    openoker 2023-02-14 10:10:02 编程基础 WSL2Windows
    0 / 2182
  • Stable Diffusion 资料

    gingo 2023-02-04 20:07:09 AI画图
    0 / 2129
  • Celery与FastAPI实现分布式异步任务队列

    openoker 2023-02-02 16:19:28 资料仓库 FastAPICelery
    1
    0 / 1982
  • 使用 Diffusers 通过 DreamBooth 来训练 Stable Diffusion

    gingo 2023-01-30 23:57:50 AI画图 图生图扩散模型
    0 / 1980
  • 2022生成模型进展盘点:9类生成模型代表作

    gingo 2023-01-30 23:17:50 资料仓库
    0 / 1713
  • 【NovelAI绘图教程】如何更好的模仿你喜欢的画师的画风——Hypernetworks模型的训练

    openoker 2022-12-15 10:58:56 AI画图 NovelAIPrompt
    1
    0 / 2979
  • windows安装Aria2教程

    openoker 2022-12-10 21:30:59 编程基础 下载BT下载
    0 / 1751
  • SecStudent :全场景视频流隐私增强算法

    openoker 2022-11-29 15:57:31 AI应用 视频工具GAN
    0 / 1910
  • 基于tensorflow2.0+使用bert获取中文词、句向量并进行相似度分析

    openoker 2022-11-15 14:53:17 资料仓库 TensorflowBert
    0 / 1788
  • 最新一些论文进展和Google的大招

    gingo 2022-11-09 16:30:26 资料仓库 Google论文
    0 / 1890
  • Novel AI水魔法效果--元素法典 The Code of Quintessence

    weilaiweiding 2022-11-05 17:11:27 AI画图
    0 / 2664
  • What are Diffusion Models?什么是Diffusion Model(扩散模型)

    gingo 2022-11-05 10:38:50 资料仓库 生成网络
    0 / 2912
  • MySQL 超大表快速删除

    openoker 2022-09-20 15:08:37 AI数据 mysql
    0 / 1839
  • 大陆市场这么大,却造不出来一块好显卡……

    openoker 2022-09-01 00:02:02 编程基础 gpu芯片
    1
    0 / 1626
  • Linux 网络工具详解之 ip tuntap 和 tunctl 创建 tap/tun 设备

    openoker 2022-08-17 17:02:56 编程基础 网络
    0 / 2047
  • AI又要进化了,lecun努力将符号推理和深度学习推动到混合模型

    gingo 2022-08-16 15:56:45 课程演讲
    0 / 2108
上一页 下一页
  • 1
  • 2
  • 3
  • 4
  • 5
签到
立即签到
签到可以获得积分哦!
公告

AI千集是一个专注于科研服务的智能平台
在这里您可以获得本平台自训练的
科研智能体
和小伙伴一起玩转AI,做自己的AI机器人
来AI千集,赋能智慧快人一步
扫一扫,快速获取解决方案与报价
立即咨询

最新资讯更多
  • Deep Reinforcement learning (DRL) has achieved remarkable success in domains with well-defined reward structures, such as Atari games and locomotion. In contrast, dexterous manipulation lacks general-purpose reward formulations and typically depends on task-specific, handcrafted priors to guide hand-object interactions. We propose Contact Coverage-Guided Exploration (CCGE), a general exploration method designed for general-purpose dexterous manipulation tasks. CCGE represents contact state as the intersection between object surface points and predefined hand keypoints, encouraging dexterous hands to discover diverse and novel contact patterns, namely which fingers contact which object regions. It maintains a contact counter conditioned on discretized object states obtained via learned hash codes, capturing how frequently each finger interacts with different object regions. This counter is leveraged in two complementary ways:(1) to assign a count-based contact coverage reward that promotes exploration of novel contact patterns, and (2) an energy-based reaching reward that guides the agent toward under-explored contact regions. We evaluate CCGE on a diverse set of dexterous manipulation tasks, including cluttered object singulation, constrained object retrieval, in-hand reorientation, and bimanual manipulation. Experimental results show that CCGE substantially improves training efficiency and success rates over existing exploration methods, and that the contact patterns learned with CCGE transfer robustly to real-world robotic systems. Project page is
    喜欢 0    评论 0
  • Vision Language Models (VLMs) exhibit persistent hallucinations in counting tasks, with accuracy substantially lower than other visual reasoning tasks (excluding sentiment). This phenomenon persists even in state-of-the-art reasoning-capable VLMs. Conversely, CNN-based object detection models (ODMs) such as YOLO excel at spatial localization and instance counting with minimal computational overhead. We propose GroundCount, a framework that augments VLMs with explicit spatial grounding from ODMs to mitigate counting hallucinations. In the best case, our prompt-based augmentation strategy achieves 81.3% counting accuracy on the best-performing model (Ovis2.5-2B) - a 6.6pp improvement - while reducing inference time by 22% through elimination of hallucination-driven reasoning loops for stronger models. We conduct comprehensive ablation studies demonstrating that positional encoding is a critical component, being beneficial for stronger models but detrimental for weaker ones. Confidence scores, by contrast, introduce noise for most architectures and their removal improves performance in four of five evaluated models. We further evaluate feature-level fusion architectures, finding that explicit symbolic grounding via structured prompts outperforms implicit feature fusion despite sophisticated cross-attention mechanisms. Our approach yields consistent improvements across four of five evaluated VLM architectures (6.2--7.5pp), with one architecture exhibiting degraded performance due to incompatibility between its iterative reflection mechanisms and structured prompts. These results suggest that counting failures stem from fundamental spatial-semantic integration limitations rather than architecture-specific deficiencies, while highlighting the importance of architectural compatibility in augmentation strategies.
    喜欢 0    评论 0
  • The rapid evolution and inherent complexity of modern software requirements demand highly flexible and responsive development methodologies. While Agile frameworks have become the industry standard for prioritizing iteration, collaboration, and adaptability, software development teams continue to face persistent challenges in managing constantly evolving requirements and maintaining product quality under tight deadlines. This article explores the intersection of Artificial Intelligence (AI) and Software Engineering (SE), to analyze how AI serves as a powerful catalyst for enhancing agility and fostering innovation. The research combines a comprehensive review of existing literature with an empirical study, utilizing a survey directed at Software Engineering professionals to assess the perception, adoption, and impact of AI-driven tools. Key findings reveal that the integration of AI (specifically through Machine Learning (ML) and Natural Language Processing (NLP) )facilitates the automation of tedious tasks, from requirement management to code generation and testing . This paper demonstrates that AI not only optimizes current Agile practices but also introduces new capabilities essential for sustaining quality, speed, and innovation in the future landscape of software development.
    喜欢 0    评论 0
  • Human uplift studies - or studies that measure AI effects on human performance relative to a status quo, typically using randomized controlled trial (RCT) methodology - are increasingly used to inform deployment, governance, and safety decisions for frontier AI systems. While the methods underlying these studies are well-established, their interaction with the distinctive properties of frontier AI systems remains underexamined, particularly when results are used to inform high-stakes decisions. We present findings from interviews with 16 expert practitioners with experience conducting human uplift studies in domains including biosecurity, cybersecurity, education, and labor. Across interviews, experts described a recurring tension between standard causal inference assumptions and the object of study itself. Rapidly evolving AI systems, shifting baselines, heterogeneous and changing user proficiency, and porous real-world settings strain assumptions underlying internal, external, and construct validity, complicating the interpretation and appropriate use of uplift evidence. We synthesize these challenges across key stages of the human uplift research lifecycle and map them to practitioner-reported solutions, clarifying both the limits and the appropriate uses of evidence from human uplift studies in high-stakes decision-making.
    喜欢 0    评论 0
  • VLMs have become increasingly proficient at a range of computer vision tasks, such as visual question answering and object detection. This includes increasingly strong capabilities in the domain of art, from analyzing artwork to generation of art. In an interdisciplinary collaboration between computer scientists and art historians, we characterize the mechanisms underlying VLMs' ability to predict artistic style and assess the extent to which they align with the criteria art historians use to reason about artistic style. We employ a latent-space decomposition approach to identify concepts that drive art style prediction and conduct quantitative evaluations, causal analysis and assessment by art historians. Our findings indicate that 73% of the extracted concepts are judged by art historians to exhibit a coherent and semantically meaningful visual feature and 90% of concepts used to predict style of a given artwork were judged relevant. In cases where an irrelevant concept was used to successfully predict style, art historians identified possible reasons for its success; for example, the model might "understand" a concept in more formal terms, such as dark/light contrasts.
    喜欢 0    评论 0
  • We present IsalGraph, a method for representing the structure of any finite, simple graph as a compact string over a nine-character instruction alphabet. The encoding is executed by a small virtual machine comprising a sparse graph, a circular doubly-linked list (CDLL) of graph-node references, and two traversal pointers. Instructions either move a pointer through the CDLL or insert a node or edge into the graph. A key design property is that every string over the alphabet decodes to a valid graph, with no invalid states reachable. A greedy \emph{GraphToString} algorithm encodes any connected graph into a string in time polynomial in the number of nodes; an exhaustive-backtracking variant produces a canonical string by selecting the lexicographically smallest shortest string across all starting nodes and all valid traversal orders. We evaluate the representation on five real-world graph benchmark datasets (IAM Letter LOW/MED/HIGH, LINUX, and AIDS) and show that the Levenshtein distance between IsalGraph strings correlates strongly with graph edit distance (GED). Together, these properties make IsalGraph strings a compact, isomorphism-invariant, and language-model-compatible sequential encoding of graph structure, with direct applications in graph similarity search, graph generation, and graph-conditioned language modelling
    喜欢 0    评论 0
积分排行
  • gingo

    104 帖子 • 8 评论

    1431
  • 137****0631

    0 帖子 • 0 评论

    997
  • weilaiweiding

    52 帖子 • 1 评论

    821
  • icodebase

    59 帖子 • 1 评论

    528
  • openoker

    171 帖子 • 23 评论

    298
  • 188****2791

    0 帖子 • 0 评论

    180
  • boatingman

    0 帖子 • 1 评论

    90
  • toc

    0 帖子 • 0 评论

    60
  • Van

    3 帖子 • 0 评论

    42
  • shadow

    5 帖子 • 0 评论

    28
友情链接 查看更多>>
  • 导航

    打造最强静态导航网站

  • 135AI排版

    公众号智能文案生成与自动排版工具

  • 二次元数字人视频生成平台

    二次元数字人视频生成平台

关于 标签 友链
粤ICP备18152112号 网信算备330110507206401230035号 粤公网安备44030302001590号 © 2018-2025 AI千集 All Rights Reserved