智能论文笔记

AdvDO: Realistic Adversarial Attacks for Trajectory Prediction

Yulong Cao , Chaowei Xiao , Anima Anandkumar , Danfei Xu , Marco Pavone

分类：机器学习 | 人工智能

2022-09-19

轨迹预测对于自动驾驶汽车（AV）是必不可少的，以计划正确且安全的驾驶行为。尽管许多先前的作品旨在达到更高的预测准确性，但很少有人研究其方法的对抗性鲁棒性。为了弥合这一差距，我们建议研究数据驱动的轨迹预测系统的对抗性鲁棒性。我们设计了一个基于优化的对抗攻击框架，该框架利用精心设计的可区分动态模型来生成逼真的对抗轨迹。从经验上讲，我们基于最先进的预测模型的对抗性鲁棒性，并表明我们的攻击使通用指标和计划感知指标的预测错误增加了50％以上和37％。我们还表明，我们的攻击可以导致AV在模拟中驶离道路或碰撞到其他车辆中。最后，我们演示了如何使用对抗训练计划来减轻对抗性攻击。

translated by 谷歌翻译

Robust Trajectory Prediction against Adversarial Attacks

Yulong Cao , Danfei Xu , Xinshuo Weng , Zhuoqing Mao , Anima Anandkumar , Chaowei Xiao , Marco Pavone

分类：机器学习 | 人工智能 | 计算机视觉

2022-07-29

使用深神经网络（DNN）的轨迹预测是自主驾驶（AD）系统的重要组成部分。但是，这些方法容易受到对抗攻击的影响，从而导致严重的后果，例如碰撞。在这项工作中，我们确定了两种关键要素，以捍卫轨迹预测模型，以防止（1）设计有效的对抗训练方法，以及（2）添加特定领域的数据增强以减轻清洁数据的性能降低。我们证明，与经过干净数据训练的模型相比，我们的方法能够在对抗数据上的性能提高46％，而在干净数据上只有3％的性能退化。此外，与现有的强大方法相比，我们的方法可以在对抗性示例中提高21％的性能，而在清洁数据上可以提高9％。我们的健壮模型与计划者一起评估，以研究其下游影响。我们证明我们的模型可以大大降低严重的事故率（例如碰撞和越野驾驶）。

translated by 谷歌翻译

Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior

Davis Rempe , Jonah Philion , Leonidas J. Guibas , Sanja Fidler , Or Litany

分类：计算机视觉 | 机器学习 | 机器人

2021-12-09

自治车辆的评估和改善规划需要可扩展的长尾交通方案。有用的是，这些情景必须是现实的和挑战性的，但不能安全地开车。在这项工作中，我们介绍努力，一种自动生成具有挑战性的场景的方法，导致给定的计划者产生不良行为，如冲突。为了维护情景合理性，关键的想法是利用基于图形的条件VAE的形式利用学习的交通运动模型。方案生成在该流量模型的潜在空间中制定了优化，通过扰乱初始的真实世界的场景来产生与给定计划者碰撞的轨迹。随后的优化用于找到“解决方案”的场景，确保改进给定的计划者是有用的。进一步的分析基于碰撞类型的群集生成的场景。我们攻击两名策划者并展示争取在这两种情况下成功地产生了现实，具有挑战性的情景。我们另外“关闭循环”并使用这些方案优化基于规则的策划器的超参数。

translated by 谷歌翻译

On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles

Qingzhao Zhang , Shengtuo Hu , Jiachen Sun , Qi Alfred Chen , Z. Morley Mao

分类：计算机视觉

2022-01-13

轨迹预测是自动车辆（AVS）执行安全规划和导航的关键组件。然而，很少有研究分析了轨迹预测的对抗性稳健性，或者调查了最坏情况的预测是否仍然可以导致安全规划。为了弥合这种差距，我们通过提出普通车辆轨迹来最大化预测误差来研究轨迹预测模型的对抗鲁棒性。我们在三个模型和三个数据集上的实验表明，对手预测将预测误差增加超过150％。我们的案例研究表明，如果对手在对手轨迹之后驱动靠近目标AV的车辆，则AV可以进行不准确的预测，甚至不安全的驾驶决策。我们还通过数据增强和轨迹平滑探索可能的缓解技术。

translated by 谷歌翻译

Differentiable Integrated Motion Prediction and Planning with Learnable Cost Function for Autonomous Driving

Zhiyu Huang , Haochen Liu , Jingda Wu , Chen Lv

分类：机器人

2022-07-21

相应地预测周围交通参与者的未来状态，并计划安全，平稳且符合社会的轨迹对于自动驾驶汽车至关重要。当前的自主驾驶系统有两个主要问题：预测模块通常与计划模块解耦，并且计划的成本功能很难指定和调整。为了解决这些问题，我们提出了一个端到端的可区分框架，该框架集成了预测和计划模块，并能够从数据中学习成本函数。具体而言，我们采用可区分的非线性优化器作为运动计划者，该运动计划将神经网络给出的周围剂的预测轨迹作为输入，并优化了自动驾驶汽车的轨迹，从而使框架中的所有操作都可以在框架中具有可观的成本，包括成本功能权重。提出的框架经过大规模的现实驾驶数据集进行了训练，以模仿整个驾驶场景中的人类驾驶轨迹，并在开环和闭环界面中进行了验证。开环测试结果表明，所提出的方法的表现优于各种指标的基线方法，并提供以计划为中心的预测结果，从而使计划模块能够输出接近人类的轨迹。在闭环测试中，提出的方法表明能够处理复杂的城市驾驶场景和鲁棒性，以抵抗模仿学习方法所遭受的分配转移。重要的是，我们发现计划和预测模块的联合培训比在开环和闭环测试中使用单独的训练有素的预测模块进行计划要比计划更好。此外，消融研究表明，框架中的可学习组件对于确保计划稳定性和性能至关重要。

translated by 谷歌翻译

DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles

Peter Karkus , Boris Ivanovic , Shie Mannor , Marco Pavone

分类：机器人 | 人工智能 | 计算机视觉 | 机器学习

2022-12-13

Autonomous vehicle (AV) stacks are typically built in a modular fashion, with explicit components performing detection, tracking, prediction, planning, control, etc. While modularity improves reusability, interpretability, and generalizability, it also suffers from compounding errors, information bottlenecks, and integration challenges. To overcome these challenges, a prominent approach is to convert the AV stack into an end-to-end neural network and train it with data. While such approaches have achieved impressive results, they typically lack interpretability and reusability, and they eschew principled analytical components, such as planning and control, in favor of deep neural networks. To enable the joint optimization of AV stacks while retaining modularity, we present DiffStack, a differentiable and modular stack for prediction, planning, and control. Crucially, our model-based planning and control algorithms leverage recent advancements in differentiable optimization to produce gradients, enabling optimization of upstream components, such as prediction, via backpropagation through planning and control. Our results on the nuScenes dataset indicate that end-to-end training with DiffStack yields substantial improvements in open-loop and closed-loop planning metrics by, e.g., learning to make fewer prediction errors that would affect planning. Beyond these immediate benefits, DiffStack opens up new opportunities for fully data-driven yet modular and interpretable AV architectures. Project website: https://sites.google.com/view/diffstack

translated by 谷歌翻译

AdvSim: Generating Safety-Critical Scenarios for Self-Driving Vehicles

Jingkang Wang , Ava Pun , James Tu , Sivabalan Manivasagam , Abbas Sadat , Sergio Casas , Mengye Ren , Raquel Urtasun

分类：机器人 | 人工智能 | 计算机视觉 | 机器学习

2021-01-16

由于自动驾驶系统变得更好，模拟自动堆栈可能失败的方案变得更加重要。传统上，这些方案对于一些关于将地理演奏器状态作为输入的规划模块而产生的一些场景。这不会缩放，无法识别所有可能的自主义故障，例如由于遮挡引起的感知故障。在本文中，我们提出了对基于LIDAR的自治系统产生了安全性临界情景的促进框架。鉴于初始交通方案，Advsim以物理卓越的方式修改演员的轨迹，并更新LIDAR传感器数据以匹配扰动的世界。重要的是，通过直接模拟传感器数据，我们获得对完整自主堆栈的安全关键的对抗方案。我们的实验表明，我们的方法是一般的，可以识别成千上万的语义有意义的安全关键方案，适用于各种现代自动驾驶系统。此外，我们表明，通过使用Advsim产生的情景训练，可以进一步改善这些系统的稳健性和安全性。

translated by 谷歌翻译

BITS: Bi-level Imitation for Traffic Simulation

Danfei Xu , Yuxiao Chen , Boris Ivanovic , Marco Pavone

分类：机器人 | 机器学习

2022-08-26

仿真是对机器人系统（例如自动驾驶汽车）进行扩展验证和验证的关键。尽管高保真物理和传感器模拟取得了进步，但在模拟道路使用者的现实行为方面仍然存在一个危险的差距。这是因为，与模拟物理和图形不同，设计人类行为的第一个原理模型通常是不可行的。在这项工作中，我们采用了一种数据驱动的方法，并提出了一种可以学会从现实世界驱动日志中产生流量行为的方法。该方法通过将交通仿真问题分解为高级意图推理和低级驾驶行为模仿，通过利用驾驶行为的双层层次结构来实现高样本效率和行为多样性。该方法还结合了一个计划模块，以获得稳定的长马行为。我们从经验上验证了我们的方法，即交通模拟（位）的双层模仿，并具有来自两个大规模驾驶数据集的场景，并表明位表明，在现实主义，多样性和长途稳定性方面可以达到平衡的交通模拟性能。我们还探索了评估行为现实主义的方法，并引入了一套评估指标以进行交通模拟。最后，作为我们的核心贡献的一部分，我们开发和开源一个软件工具，该工具将跨不同驱动数据集的数据格式统一，并将现有数据集将场景转换为交互式仿真环境。有关其他信息和视频，请参见https://sites.google.com/view/nvr-bits2022/home

translated by 谷歌翻译

HTML版本

Vehicle trajectory prediction works, but not everywhere

Mohammadhossein Bahari , Saeed Saadatnejad , Ahmad Rahimi , Mohammad Shaverdikondori , Mohammad Shahidzadeh , Seyed-Mohsen Moosavi-Dezfooli , Alexandre Alahi

分类：计算机视觉

2021-12-07

现在是车辆轨迹预测是自动驾驶汽车的基本支柱。行业和研究社区都通过运行公共基准来承认这一柱的需求。而最先进的方法令人印象深刻，即，他们没有越野预测，他们对基准之外的城市的概括是未知的。在这项工作中，我们表明这些方法不会概括为新场景。我们提出了一种新颖的方法，可自动生成逼真的场景，导致最先进的模型越野。我们通过对抗场景生成的镜头来框架问题。我们推广基于原子场景生成功能的简单而有效的生成模型以及物理约束。我们的实验表明，可以在制作预测方法失败的方式中修改来自当前基准的超过60 000 \％$ 60 \％。我们进一步表明（i）生成的场景是现实的，因为它们确实存在于现实世界中，并且（ii）可用于使现有型号强大30-40％。代码可在https://sattack.github.io/处获得。

translated by 谷歌翻译

Conditional Predictive Behavior Planning with Inverse Reinforcement Learning for Human-like Autonomous Driving

Zhiyu Huang , Haochen Liu , Jingda Wu , Chen Lv

分类：机器人

2022-12-17

Making safe and human-like decisions is an essential capability of autonomous driving systems and learning-based behavior planning is a promising pathway toward this objective. Distinguished from existing learning-based methods that directly output decisions, this work introduces a predictive behavior planning framework that learns to predict and evaluate from human driving data. Concretely, a behavior generation module first produces a diverse set of candidate behaviors in the form of trajectory proposals. Then the proposed conditional motion prediction network is employed to forecast other agents' future trajectories conditioned on each trajectory proposal. Given the candidate plans and associated prediction results, we learn a scoring module to evaluate the plans using maximum entropy inverse reinforcement learning (IRL). We conduct comprehensive experiments to validate the proposed framework on a large-scale real-world urban driving dataset. The results reveal that the conditional prediction model is able to forecast multiple possible future trajectories given a candidate behavior and the prediction results are reactive to different plans. Moreover, the IRL-based scoring module can properly evaluate the trajectory proposals and select close-to-human ones. The proposed framework outperforms other baseline methods in terms of similarity to human driving trajectories. Moreover, we find that the conditional prediction model can improve both prediction and planning performance compared to the non-conditional model, and learning the scoring module is critical to correctly evaluating the candidate plans to align with human drivers.

translated by 谷歌翻译

Analyzing and Enhancing Closed-loop Stability in Reactive Simulation

Wei-Jer Chang , Yeping Hu , Chenran Li , Wei Zhan , Masayoshi Tomizuka

分类：机器人 | 人工智能 | 机器学习

2022-08-09

模拟在有效评估自动驾驶汽车方面发挥了重要作用。现有方法主要依赖于基于启发式的模拟，在该模拟中，交通参与者遵循某些无法产生复杂人类行为的人类编码的规则。因此，提出了反应性仿真概念，以通过利用现实世界数据来弥合模拟和现实世界交通情况之间的人类行为差距。但是，这些反应性模型可以在模拟几个步骤后轻松地产生不合理的行为，我们将模型视为失去其稳定性。据我们所知，没有任何工作明确讨论并分析了反应性仿真框架的稳定性。在本文中，我们旨在对反应性模拟进行彻底的稳定性分析，并提出一种增强稳定性的解决方案。具体而言，我们首先提出了一个新的反应模拟框架，在其中我们发现模拟状态序列的平滑度和一致性是稳定性的关键因素。然后，我们将运动学媒介物模型纳入框架中，以提高反应性模拟的闭环稳定性。此外，在本文中提出了一些新颖的指标，以更好地分析模拟性能。

translated by 谷歌翻译

nuPlan: A closed-loop ML-based planning benchmark for autonomous vehicles

Holger Caesar , Juraj Kabzan , Kok Seang Tan , Whye Kit Fong , Eric Wolff , Alex Lang , Luke Fletcher , Oscar Beijbom , Sammy Omari

分类：计算机视觉

2021-06-22

在这项工作中，我们提出了世界上第一个基于闭环ML的自动驾驶计划基准。虽然存在基于ML的ML的越来越多的ML的议员，但缺乏已建立的数据集和指标限制了该领域的进展。自主车辆运动预测的现有基准专注于短期运动预测，而不是长期规划。这导致了以前的作品来使用基于L2的度量标准的开放循环评估，这不适合公平地评估长期规划。我们的基准通过引入大规模驾驶数据集，轻量级闭环模拟器和特定于运动规划的指标来克服这些限制。我们提供高质量的数据集，在美国和亚洲的4个城市提供1500h的人类驾驶数据，具有广泛不同的交通模式（波士顿，匹兹堡，拉斯维加斯和新加坡）。我们将提供具有无功代理的闭环仿真框架，并提供一系列一般和方案特定的规划指标。我们计划在Neurips 2021上发布数据集，并在2022年初开始组织基准挑战。

translated by 谷歌翻译

Goal-oriented Autonomous Driving

Yihan Hu , Jiazhi Yang , Li Chen , Keyu Li , Chonghao Sima , Xizhou Zhu , Siqi Chai , Senyao Du , Tianwei Lin , Wenhai Wang

分类：计算机视觉 | 机器人

2022-12-20

Modern autonomous driving system is characterized as modular tasks in sequential order, i.e., perception, prediction and planning. As sensors and hardware get improved, there is trending popularity to devise a system that can perform a wide diversity of tasks to fulfill higher-level intelligence. Contemporary approaches resort to either deploying standalone models for individual tasks, or designing a multi-task paradigm with separate heads. These might suffer from accumulative error or negative transfer effect. Instead, we argue that a favorable algorithm framework should be devised and optimized in pursuit of the ultimate goal, i.e. planning of the self-driving-car. Oriented at this goal, we revisit the key components within perception and prediction. We analyze each module and prioritize the tasks hierarchically, such that all these tasks contribute to planning (the goal). To this end, we introduce Unified Autonomous Driving (UniAD), the first comprehensive framework up-to-date that incorporates full-stack driving tasks in one network. It is exquisitely devised to leverage advantages of each module, and provide complementary feature abstractions for agent interaction from a global perspective. Tasks are communicated with unified query design to facilitate each other toward planning. We instantiate UniAD on the challenging nuScenes benchmark. With extensive ablations, the effectiveness of using such a philosophy is proven to surpass previous state-of-the-arts by a large margin in all aspects. The full suite of codebase and models would be available to facilitate future research in the community.

translated by 谷歌翻译

RITA: Boost Autonomous Driving Simulators with Realistic Interactive Traffic Flow

Zhengbang Zhu , Shenyu Zhang , Yuzheng Zhuang , Yuecheng Liu , Minghuan Liu , Liyuan Mao , Ziqing Gong , Weinan Zhang , Shixiong Kai , Qiang Gu

分类：人工智能 | 机器人

2022-11-07

High-quality traffic flow generation is the core module in building simulators for autonomous driving. However, the majority of available simulators are incapable of replicating traffic patterns that accurately reflect the various features of real-world data while also simulating human-like reactive responses to the tested autopilot driving strategies. Taking one step forward to addressing such a problem, we propose Realistic Interactive TrAffic flow (RITA) as an integrated component of existing driving simulators to provide high-quality traffic flow for the evaluation and optimization of the tested driving strategies. RITA is developed with fidelity, diversity, and controllability in consideration, and consists of two core modules called RITABackend and RITAKit. RITABackend is built to support vehicle-wise control and provide traffic generation models from real-world datasets, while RITAKit is developed with easy-to-use interfaces for controllable traffic generation via RITABackend. We demonstrate RITA's capacity to create diversified and high-fidelity traffic simulations in several highly interactive highway scenarios. The experimental findings demonstrate that our produced RITA traffic flows meet all three design goals, hence enhancing the completeness of driving strategy evaluation. Moreover, we showcase the possibility for further improvement of baseline strategies through online fine-tuning with RITA traffic flows.

translated by 谷歌翻译

GET-DIPP: Graph-Embedded Transformer for Differentiable Integrated Prediction and Planning

Jiawei Sun , Chengran Yuan , Shuo Sun , Zhiyang Liu , Terence Goh , Anthony Wong , Keng Peng Tee , Marcelo H. Ang Jr

分类：机器人

2022-11-11

Accurately predicting interactive road agents' future trajectories and planning a socially compliant and human-like trajectory accordingly are important for autonomous vehicles. In this paper, we propose a planning-centric prediction neural network, which takes surrounding agents' historical states and map context information as input, and outputs the joint multi-modal prediction trajectories for surrounding agents, as well as a sequence of control commands for the ego vehicle by imitation learning. An agent-agent interaction module along the time axis is proposed in our network architecture to better comprehend the relationship among all the other intelligent agents on the road. To incorporate the map's topological information, a Dynamic Graph Convolutional Neural Network (DGCNN) is employed to process the road network topology. Besides, the whole architecture can serve as a backbone for the Differentiable Integrated motion Prediction with Planning (DIPP) method by providing accurate prediction results and initial planning commands. Experiments are conducted on real-world datasets to demonstrate the improvements made by our proposed method in both planning and prediction accuracy compared to the previous state-of-the-art methods.

translated by 谷歌翻译

Targeted Adversarial Attacks against Neural Network Trajectory Predictors

Kaiyuan Tan , Jun Wang , Yiannis Kantaros

分类：机器学习

2022-12-08

Trajectory prediction is an integral component of modern autonomous systems as it allows for envisioning future intentions of nearby moving agents. Due to the lack of other agents' dynamics and control policies, deep neural network (DNN) models are often employed for trajectory forecasting tasks. Although there exists an extensive literature on improving the accuracy of these models, there is a very limited number of works studying their robustness against adversarially crafted input trajectories. To bridge this gap, in this paper, we propose a targeted adversarial attack against DNN models for trajectory forecasting tasks. We call the proposed attack TA4TP for Targeted adversarial Attack for Trajectory Prediction. Our approach generates adversarial input trajectories that are capable of fooling DNN models into predicting user-specified target/desired trajectories. Our attack relies on solving a nonlinear constrained optimization problem where the objective function captures the deviation of the predicted trajectory from a target one while the constraints model physical requirements that the adversarial input should satisfy. The latter ensures that the inputs look natural and they are safe to execute (e.g., they are close to nominal inputs and away from obstacles). We demonstrate the effectiveness of TA4TP on two state-of-the-art DNN models and two datasets. To the best of our knowledge, we propose the first targeted adversarial attack against DNN models used for trajectory forecasting.

translated by 谷歌翻译

Long-Tail Prediction Uncertainty Aware Trajectory Planning for Self-driving Vehicles

Weitao Zhou , Zhong Cao , Nanshan Deng , Xiaoyu Liu , Kun Jiang , Diange Yang

分类：人工智能

2022-07-02

自主驾驶的典型轨迹计划通常依赖于预测周围障碍的未来行为。近年来，由于其令人印象深刻的性能，基于深度学习的预测模型已被广泛使用。但是，最近的研究表明，在长尾驾驶场景分布之后，在数据集上训练的深度学习模型将遭受“尾巴”的大量预测错误，这可能会导致计划者的失败。为此，这项工作定义了预测模型不确定性的概念，以量化由于数据稀疏而导致的高错误。此外，这项工作提出了一个轨迹规划师，以考虑对更安全性能的这种预测不确定性。首先，由于培训数据不足而导致的预测模型的不确定性是由集成网络结构估算的。然后，轨迹规划师的设计目的是考虑预测不确定性引起的最坏情况。结果表明，在数据不足引起的预测不确定性下，提出的方法可以提高轨迹计划的安全性。同时，使用足够的数据，该框架不会导致过度保守的结果。这项技术有助于在现实世界的长尾数据分布下提高自动驾驶汽车的安全性和可靠性。

translated by 谷歌翻译

A human factors approach to validating driver models for interaction-aware automated vehicles

Olger Siebinga , Arkady Zgonnikov , David Abbink

分类：机器人

2021-09-27

自动驾驶汽车的一个主要挑战是安全，平稳地与其他交通参与者进行互动。处理此类交通交互的一种有希望的方法是为自动驾驶汽车配备与感知的控制器（IACS）。这些控制器预测，周围人类驾驶员将如何根据驾驶员模型对自动驾驶汽车的行为做出响应。但是，很少验证IACS中使用的驱动程序模型的预测有效性，这可能会限制IACS在简单的模拟环境之外的交互功能。在本文中，我们认为，除了评估IAC的互动能力外，还应在自然的人类驾驶行为上验证其潜在的驱动器模型。我们为此验证提出了一个工作流程，其中包括基于方案的数据提取和基于人为因素文献的两阶段（战术/操作）评估程序。我们在一项案例研究中证明了该工作流程，该案例研究对现有IAC复制的基于反向的基于学习的驱动程序模型。该模型仅在40％的预测中显示出正确的战术行为。该模型的操作行为与观察到的人类行为不一致。案例研究表明，有原则的评估工作流程是有用和需要的。我们认为，我们的工作流将支持为将来的自动化车辆开发适当的驾驶员模型。

translated by 谷歌翻译

SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles

Chejian Xu , Wenhao Ding , Weijie Lyu , Zuxin Liu , Shuai Wang , Yihan He , Hanjiang Hu , Ding Zhao , Bo Li

分类：机器人

2022-06-20

如最近的研究所示，支持机器智能的系统容易受到对抗性操纵或自然分配变化产生的测试案例的影响。这引起了人们对现实应用程序部署机器学习算法的极大关注，尤其是在自动驾驶（AD）等安全性领域中。另一方面，由于自然主义场景的传统广告测试需要数亿英里，这是由于现实世界中安全关键方案的高度和稀有性。结果，已经探索了几种自动驾驶评估方法，但是，这些方法通常是基于不同的仿真平台，安全性 - 关键的情况的类型，场景生成算法和驾驶路线变化的方法。因此，尽管在自动驾驶测试方面进行了大量努力，但在相似条件下，比较和了解不同测试场景产生算法和测试机制的有效性和效率仍然是一项挑战。在本文中，我们旨在提供第一个统一的平台Safebench，以整合不同类型的安全性测试方案，场景生成算法以及其他变体，例如驾驶路线和环境。同时，我们实施了4种基于深入学习的AD算法，具有4种类型的输入（例如，鸟类视图，相机，相机），以对SafeBench进行公平的比较。我们发现，我们的生成的测试场景确实更具挑战性，并观察到在良性和关键安全测试方案下的广告代理的性能之间的权衡。我们认为，我们的统一平台安全基地用于大规模和有效的自动驾驶测试，将激发新的测试场景生成和安全AD算法的开发。 SafeBench可从https://safebench.github.io获得。

translated by 谷歌翻译

B-GAP: Behavior-Rich Simulation and Navigation for Autonomous Driving

Angelos Mavrogiannis , Rohan Chandra , Dinesh Manocha

分类：机器人

2020-11-07

我们解决了由具有不同驱动程序行为的道路代理人填充的密集模拟交通环境中的自我车辆导航问题。由于其异构行为引起的代理人的不可预测性，这种环境中的导航是挑战。我们提出了一种新的仿真技术，包括丰富现有的交通模拟器，其具有与不同程度的侵略性程度相对应的行为丰富的轨迹。我们在驾驶员行为建模算法的帮助下生成这些轨迹。然后，我们使用丰富的模拟器培训深度加强学习（DRL）策略，包括一组高级车辆控制命令，并在测试时间使用此策略来执行密集流量的本地导航。我们的政策隐含地模拟了交通代理商之间的交互，并计算了自助式驾驶员机动，例如超速，超速，编织和突然道路变化的激进驾驶员演习的安全轨迹。我们增强的行为丰富的模拟器可用于生成由对应于不同驱动程序行为和流量密度的轨迹组成的数据集，我们的行为的导航方案可以与最先进的导航算法相结合。

translated by 谷歌翻译