智能论文笔记

Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs

Yusheng Wang , Weiwei Song , Yidong Lou , Fei Huang , Zhiyong Tu , Shimin Zhang

分类：机器人

2021-12-25

精确和实时轨道车辆本地化以及铁路环境监测对于铁路安全至关重要。在这封信中，我们提出了一种基于多激光器的同时定位和映射（SLAM）系统，用于铁路应用。我们的方法从测量开始预处理，以便去噪并同步多个LIDAR输入。根据LIDAR放置使用不同的帧到框架注册方法。此外，我们利用来自提取的轨道轨道的平面约束来提高系统精度。本地地图进一步与利用绝对位置测量的全局地图对齐。考虑到不可避免的金属磨损和螺杆松动，在手术期间唤醒了在线外在细化。在收集3000公里的数据集上广泛验证了所提出的方法。结果表明，所提出的系统与大规模环境的有效映射一起实现了精确且稳健的本地化。我们的系统已应用于运费交通铁路以监控任务。

translated by 谷歌翻译

A-ESRGAN: Training Real-World Blind Super-Resolution with Attention U-Net Discriminators

Zihao Wei , Yidong Huang , Yuang Chen , Chenhao Zheng , Jinnan Gao

分类：计算机视觉 | 机器学习

2021-12-19

盲目图像超分辨率（SR）是CV的长期任务，旨在恢复患有未知和复杂扭曲的低分辨率图像。最近的工作主要集中在采用更复杂的退化模型来模拟真实世界的降级。由此产生的模型在感知损失和产量感知令人信服的结果取得了突破性。然而，电流生成的对抗性网络结构所带来的限制仍然是显着的：处理像素同样地导致图像的结构特征的无知，并且导致性能缺点，例如扭曲线和背景过度锐化或模糊。在本文中，我们提出了A-ESRAN，用于盲人SR任务的GAN模型，其特色是基于U-NET的U-NET的多尺度鉴别器，可以与其他发电机无缝集成。据我们所知，这是第一项介绍U-Net结构作为GaN解决盲人问题的鉴别者的工作。本文还给出了对模型的多规模注意力突破的机制的解释。通过对现有作品的比较实验，我们的模型在非参考自然图像质量评估员度量上提出了最先进的水平性能。我们的消融研究表明，利用我们的鉴别器，基于RRDB的发电机可以利用多种尺度中图像的结构特征，因此与先前作品相比，更加感知地产生了感知的高分辨率图像。

translated by 谷歌翻译

Rail Vehicle Localization and Mapping with LiDAR-Vision-Inertial-GNSS Fusion

Yusheng Wang , Weiwei Song , Yidong Lou , Yi Zhang , Fei Huang , Zhiyong Tu , Qiangsheng Liang

分类：机器人

2021-12-16

在本文中，我们介绍了全球导航卫星系统（GNSS）辅助激光乐队 - 视觉惯性方案RAILTOMER-V，用于准确且坚固的铁路车辆本地化和映射。 Raillomer-V在因子图上制定，由两个子系统组成：辅助LiDar惯性系统（OLIS）和距离的内径综合视觉惯性系统（OVI）。两个子系统都利用了铁路上的典型几何结构。提取的轨道轨道的平面约束用于补充OLI中的旋转和垂直误差。此外，线特征和消失点被利用以限制卵巢中的旋转漂移。拟议的框架在800公里的数据集中广泛评估，聚集在一年以上的一般速度和高速铁路，日夜。利用各个传感器的所有测量的紧密耦合集成，我们的框架准确到了长期的任务，并且足够强大地避免了退行的情景（铁路隧道）。此外，可以使用车载计算机实现实时性能。

translated by 谷歌翻译

RailLoMer: Rail Vehicle Localization and Mapping with LiDAR-IMU-Odometer-GNSS Data Fusion

Yusheng Wang , Yidong Lou , Yi Zhang , Weiwei Song , Fei Huang , Zhiyong Tu , Shimin Zhang

分类：机器人

2021-11-30

我们在本文中介绍Raillomer，实现实时准确和鲁棒的内径测量和轨道车辆的测绘。 Raillomer从两个Lidars，IMU，火车车程和全球导航卫星系统（GNSS）接收器接收测量。作为前端，来自IMU / Royomer缩放组的估计动作De-Skews DeSoised Point云并为框架到框架激光轨道测量产生初始猜测。作为后端，配制了基于滑动窗口的因子图以共同优化多模态信息。另外，我们利用来自提取的轨道轨道和结构外观描述符的平面约束，以进一步改善对重复结构的系统鲁棒性。为了确保全局常见和更少的模糊映射结果，我们开发了一种两级映射方法，首先以本地刻度执行扫描到地图，然后利用GNSS信息来注册模块。该方法在聚集的数据集上广泛评估了多次范围内的数据集，并且表明Raillomer即使在大或退化的环境中也能提供排入量级定位精度。我们还将Raillomer集成到互动列车状态和铁路监控系统原型设计中，已经部署到实验货量交通铁路。

translated by 谷歌翻译

MetroLoc: Metro Vehicle Mapping and Localization with LiDAR-Camera-Inertial Integration

Yusheng Wang , Weiwei Song , Yi Zhang , Fei Huang , Zhiyong Tu , Yidong Lou

分类：机器人

2021-11-01

我们提出了一种准确而坚固的多模态传感器融合框架，Metroloc，朝着最极端的场景之一，大规模地铁车辆本地化和映射。 Metroloc在以IMU为中心的状态估计器上构建，以较轻耦合的方法紧密地耦合光检测和测距（LIDAR），视觉和惯性信息。所提出的框架由三个子模块组成：IMU Odometry，LiDar - 惯性内径术（LIO）和视觉惯性内径（VIO）。 IMU被视为主要传感器，从LIO和VIO实现了从LIO和VIO的观察，以限制加速度计和陀螺仪偏差。与以前的点LIO方法相比，我们的方法通过将线路和平面特征引入运动估计来利用更多几何信息。 VIO还通过使用两条线和点来利用环境结构信息。我们所提出的方法在具有维护车辆的长期地铁环境中广泛测试。实验结果表明，该系统比使用实时性能的最先进的方法更准确和强大。此外，我们开发了一系列虚拟现实（VR）应用，以实现高效，经济，互动的轨道车辆状态和轨道基础设施监控，已经部署到室外测试铁路。

translated by 谷歌翻译

Efficient Bayesian Uncertainty Estimation for nnU-Net

Yidong Zhao , Changchun Yang , Artur Schweidtmann , Qian Tao

分类：计算机视觉 | 人工智能

2022-12-12

The self-configuring nnU-Net has achieved leading performance in a large range of medical image segmentation challenges. It is widely considered as the model of choice and a strong baseline for medical image segmentation. However, despite its extraordinary performance, nnU-Net does not supply a measure of uncertainty to indicate its possible failure. This can be problematic for large-scale image segmentation applications, where data are heterogeneous and nnU-Net may fail without notice. In this work, we introduce a novel method to estimate nnU-Net uncertainty for medical image segmentation. We propose a highly effective scheme for posterior sampling of weight space for Bayesian uncertainty estimation. Different from previous baseline methods such as Monte Carlo Dropout and mean-field Bayesian Neural Networks, our proposed method does not require a variational architecture and keeps the original nnU-Net architecture intact, thereby preserving its excellent performance and ease of use. Additionally, we boost the segmentation performance over the original nnU-Net via marginalizing multi-modal posterior models. We applied our method on the public ACDC and M&M datasets of cardiac MRI and demonstrated improved uncertainty estimation over a range of baseline methods. The proposed method further strengthens nnU-Net for medical image segmentation in terms of both segmentation accuracy and quality control.

translated by 谷歌翻译

MOPRD: A multidisciplinary open peer review dataset

Jialiang Lin , Jiaxin Song , Zhangping Zhou , Yidong Chen , Xiaodong Shi

分类：人工智能 | 自然语言处理 | 机器学习

2022-12-09

Open peer review is a growing trend in academic publications. Public access to peer review data can benefit both the academic and publishing communities. It also serves as a great support to studies on review comment generation and further to the realization of automated scholarly paper review. However, most of the existing peer review datasets do not provide data that cover the whole peer review process. Apart from this, their data are not diversified enough as they are mainly collected from the field of computer science. These two drawbacks of the currently available peer review datasets need to be addressed to unlock more opportunities for related studies. In response to this problem, we construct MOPRD, a multidisciplinary open peer review dataset. This dataset consists of paper metadata, multiple version manuscripts, review comments, meta-reviews, author's rebuttal letters, and editorial decisions. Moreover, we design a modular guided review comment generation method based on MOPRD. Experiments show that our method delivers better performance indicated by both automatic metrics and human evaluation. We also explore other potential applications of MOPRD, including meta-review generation, editorial decision prediction, author rebuttal generation, and scientometric analysis. MOPRD is a strong endorsement for further studies in peer review-related research and other applications.

translated by 谷歌翻译

GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective

Linyi Yang , Shuibai Zhang , Libo Qin , Yafu Li , Yidong Wang , Hanmeng Liu , Jindong Wang , Xing Xie , Yue Zhang

分类：自然语言处理 | 人工智能 | 机器学习

2022-11-15

Pre-trained language models (PLMs) are known to improve the generalization performance of natural language understanding models by leveraging large amounts of data during the pre-training phase. However, the out-of-distribution (OOD) generalization problem remains a challenge in many NLP tasks, limiting the real-world deployment of these methods. This paper presents the first attempt at creating a unified benchmark named GLUE-X for evaluating OOD robustness in NLP models, highlighting the importance of OOD robustness and providing insights on how to measure the robustness of a model and how to improve it. The benchmark includes 13 publicly available datasets for OOD testing, and evaluations are conducted on 8 classic NLP tasks over 19 popularly used PLMs. Our findings confirm the need for improved OOD accuracy in NLP tasks, as significant performance degradation was observed in all settings compared to in-distribution (ID) accuracy.

translated by 谷歌翻译

Automatic Analysis of Available Source Code of Top Artificial Intelligence Conference Papers

Jialiang Lin , Yingmin Wang , Yao Yu , Yu Zhou , Yidong Chen , Xiaodong Shi

分类：人工智能 | 自然语言处理 | 机器学习

2022-09-28

源代码对于研究人员重现方法并复制人工智能（AI）论文的结果至关重要。一些组织和研究人员手动收集具有可用源代码的AI论文，以对AI社区做出贡献。但是，手动收集是一项劳动密集型且耗时的任务。为了解决此问题，我们提出了一种方法，可以自动识别具有可用源代码的论文并提取其源代码存储库URL。通过这种方法，我们发现，从2010年到2019年发布的10个最高AI会议的常规论文中有20.5％被确定为具有可用源代码的论文，并且这些源代码存储库中有8.1％不再可访问。我们还创建了XMU NLP Lab ReadMe数据集，这是用于源代码文档研究的标记已读数文件的最大数据集。通过此数据集，我们发现了很多读书文件没有提供的安装说明或使用教程。此外，对AI会议论文的源代码的一般图片进行了大规模的综合统计分析。提出的解决方案还可以超越AI会议论文，以分析来自期刊和会议的其他科学论文，以阐明更多领域。

translated by 谷歌翻译

Towards Optimization and Model Selection for Domain Generalization: A Mixup-guided Solution

Wang Lu , Jindong Wang , Yidong Wang , Kan Ren , Yiqiang Chen , Xing Xie

分类：机器学习

2022-09-01

培训和测试数据之间的分布变化通常会破坏深度学习模型的性能。近年来，许多工作都注意存在分布转移的领域泛化（DG），而目标数据看不见。尽管算法设计取得了进展，但长期以来一直忽略了两个基础因素：1）基于正则化的目标（例如，分布对齐）的优化和2）DG的模型选择，因为无法利用有关目标域的知识。在本文中，我们提出了用于域概括的优化和选择技术的混合。为了进行优化，我们利用改编的混音来生成一个分发数据集，该数据集可以指导首选项方向并通过帕累托优化进行优化。对于模型选择，我们生成一个验证数据集，距离目标分布距离更遥远，从而可以更好地表示目标数据。我们还提出了一些理论见解。对一个视觉分类基准和三个时间序列基准的全面实验表明，我们的模型优化和选择技术可以在很大程度上可以改善现有域概括算法的性能，甚至可以取得新的最先进的结果。

translated by 谷歌翻译