智能论文笔记

Automated Level Crossing System: A Computer Vision Based Approach with Raspberry Pi Microcontroller

Rafid Umayer Murshed , Sandip Kollol Dhruba , Md. Tawheedul Islam Bhuian , Mst. Rumi Akter

分类：计算机视觉

2022-12-08

In a rapidly flourishing country like Bangladesh, accidents in unmanned level crossings are increasing daily. This study presents a deep learning-based approach for automating level crossing junctions, ensuring maximum safety. Here, we develop a fully automated technique using computer vision on a microcontroller that will reduce and eliminate level-crossing deaths and accidents. A Raspberry Pi microcontroller detects impending trains using computer vision on live video, and the intersection is closed until the incoming train passes unimpeded. Live video activity recognition and object detection algorithms scan the junction 24/7. Self-regulating microcontrollers control the entire process. When persistent unauthorized activity is identified, authorities, such as police and fire brigade, are notified via automated messages and notifications. The microcontroller evaluates live rail-track data, and arrival and departure times to anticipate ETAs, train position, velocity, and track problems to avoid head-on collisions. This proposed scheme reduces level crossing accidents and fatalities at a lower cost than current market solutions. Index Terms: Deep Learning, Microcontroller, Object Detection, Railway Crossing, Raspberry Pi

translated by 谷歌翻译

Depth-based Sampling and Steering Constraints for Memoryless Local Planners

Thai Binh Nguyen , Linh Nguyen , Tanveer Choudhury , Kathleen Keogh , Manzur Murshed

分类：机器人

2022-11-06

By utilizing only depth information, the paper introduces a novel but efficient local planning approach that enhances not only computational efficiency but also planning performances for memoryless local planners. The sampling is first proposed to be based on the depth data which can identify and eliminate a specific type of in-collision trajectories in the sampled motion primitive library. More specifically, all the obscured primitives' endpoints are found through querying the depth values and excluded from the sampled set, which can significantly reduce the computational workload required in collision checking. On the other hand, we furthermore propose a steering mechanism also based on the depth information to effectively prevent an autonomous vehicle from getting stuck when facing a large convex obstacle, providing a higher level of autonomy for a planning system. Our steering technique is theoretically proved to be complete in scenarios of convex obstacles. To evaluate effectiveness of the proposed DEpth based both Sampling and Steering (DESS) methods, we implemented them in the synthetic environments where a quadrotor was simulated flying through a cluttered region with multiple size-different obstacles. The obtained results demonstrate that the proposed approach can considerably decrease computing time in local planners, where more trajectories can be evaluated while the best path with much lower cost can be found. More importantly, the success rates calculated by the fact that the robot successfully navigated to the destinations in different testing scenarios are always higher than 99.6% on average.

translated by 谷歌翻译

A Guide to Employ Hyperspectral Imaging for Assessing Wheat Quality at Different Stages of Supply Chain in Australia: A Review

Priyabrata Karmakar , Shyh Wei Teng. Manzur Murshed , Paul Pang , Cuong Van Bui

分类：计算机视觉 | 人工智能

2022-09-13

小麦是全球主要的主食之一。因此，必须衡量，维护和改善人类消费的小麦质量。传统的小麦质量测量方法主要具有侵入性，破坏性，并且仅限于小麦样本。在典型的小麦供应链中，有许多接收点，散装小麦到来，根据要求将其存储和转发。在此接受点，传统质量测量方法的应用非常困难，而且通常非常昂贵。因此，需要非侵入性，无损的实时方法来进行小麦质量评估。满足上述标准的一种这样的方法是用于食品质量测量的高光谱成像（HSI），也可以应用于批量样品。在本文中，我们研究了如何在文献中使用HSI来评估储存的小麦质量。因此，可以在单个紧凑的文档中提供所需的信息，以在澳大利亚供应链的不同阶段实施实时数字质量评估方法。

translated by 谷歌翻译

Efficient Motion Modelling with Variable-sized blocks from Hierarchical Cuboidal Partitioning

Priyabrata Karmakar , Manzur Murshed , Manoranjan Paul , David Taubman

分类：计算机视觉

2022-08-28

具有基于块体系结构的运动建模已被广泛用于视频编码中，其中框架分为固定尺寸的块，这些块是独立补偿的。这通常会导致编码效率低下，因为固定尺寸的块几乎与对象边界不符。尽管已经引入了层次结构分区来解决这一问题，但运动矢量的增加限制了收益。最近，与立方体分配的图像的近似分割已经普及。可变大小的矩形片段（立方体）不仅容易适应基于块的图像/视频编码技术，而且还可以很好地与对象边界保持一致。这是因为立方分区基于同质性约束，从而最大程度地减少了平方误差的总和（SSE）。在本文中，我们研究了针对可扩展视频编码中使用的固定尺寸块的运动模型的潜力。具体而言，我们使用图片组（GOP）中的锚框的立方分区信息构建了运动补偿帧。然后，预测的当前帧已用作基础层，同时使用可扩展的HEVC编码器编码当前帧作为增强层。实验结果确认4K视频序列上节省了6.71％-10.90％的比特率。

translated by 谷歌翻译

A new way of video compression via forward-referencing using deep learning

S. M. A. K. Rajin , M. Murshed , M. Paul , S. W. Teng , J. Ma

分类：计算机视觉

2022-08-13

为了利用同一场景的视频框架中的高时间相关性，使用基于块的运动估计和补偿技术从已经编码的参考帧中预测了当前帧。尽管这种方法可以有效利用移动对象的翻译运动，但它容易受到其他类型的仿射运动和对象遮挡/除含量的影响。最近，深度学习已被用来模拟人类姿势的高级结构，以从短视频中的特定动作中进行，然后通过使用生成的对抗网络（GAN）来预测姿势，从而在未来的时间内生成虚拟框架。因此，建模人姿势的高级结构能够通过预测人类的行为并确定其轨迹来利用语义相关性。视频监视应用程序将受益，因为可以通过估算人类姿势轨迹并通过语义相关性产生未来的框架来压缩存储的大监视数据。本文通过从已经编码的框架中对人姿势进行建模并在当前时间使用生成的框架来探讨一种新的视频编码方式。预计所提出的方法可以通过预测包含具有较低残差的移动对象的块来克服传统向后引用框架的局限性。实验结果表明，提出的方法平均可以实现高达2.83 dB PSNR增益和25.93 \％比特率的节省，用于高运动视频序列

translated by 谷歌翻译

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks

Rafid Mahmood , James Lucas , David Acuna , Daiqing Li , Jonah Philion , Jose M. Alvarez , Zhiding Yu , Sanja Fidler , Marc T. Law

分类：计算机视觉 | 机器学习

2022-07-04

给定一个较小的培训数据集和学习算法，要达到目标验证或测试性能需要多少数据？这个问题至关重要，在诸如自动驾驶或医学成像之类的应用中，收集数据昂贵且耗时。高估或低估数据需求会带来大量费用，而预算可以避免。关于神经缩放定律的先前工作表明，幂律函数可以符合验证性能曲线并将其推断为较大的数据集大小。我们发现，这并不能立即转化为估计所需数据集大小以满足目标性能的更困难的下游任务。在这项工作中，我们考虑了一系列的计算机视觉任务，并系统地研究了一个概括功能功能的功能家族，以便更好地估算数据需求。最后，我们表明，结合调整的校正因子并在多个回合中收集会显着提高数据估计器的性能。使用我们的准则，从业人员可以准确估算机器学习系统的数据要求，以节省开发时间和数据采集成本。

translated by 谷歌翻译

Cryo-shift: Reducing domain shift in cryo-electron subtomograms with unsupervised domain adaptation and randomization

Hmrishav Bandyopadhyay , Zihao Deng , Leiting Ding , Sinuo Liu , Mostofa Rafid Uddin , Xiangrui Zeng , Sima Behpour , Min Xu

分类：计算机视觉 | 机器学习

2021-11-17

冷冻电子断层扫描（Cryo-et）是一种3D成像技术，可以在近原子分辨率下原位地置于亚细胞结构。细胞冷冻剂图像有助于解决大分子的结构并在单个细胞中确定它们的空间关系，这对细胞和结构生物学具有广泛的意义。体摩数分类和识别构成了这些大分子结构的系统恢复的主要步骤。已被证明监督深度学习方法对重组分类进行高度准确和高效，而是由于缺乏注释数据而受到有限的适用性。虽然生成用于训练监督模型的模拟数据是潜在的解决方案，但与真实实验数据相比，生成数据中的图像强度分布的相当差异将导致训练有素的模型在预测真实错误谱图上预测类别中的差。在这项工作中，我们呈现了低温，一个完全无监督的域适应和随机化框架，用于深入学习的跨域重组分类。我们使用无监督的多逆境域适应来减少模拟和实验数据的特征之间的域移位。我们使用“翘曲”模块开发网络驱动的域随机化过程，以改变模拟数据，并帮助分类器在实验数据上更好地推广。我们不使用任何标记的实验数据来训练我们的模型，而一些现有的替代方法需要标记为跨域分类的实验样本。然而，在本文在本文中，使用两种模拟和实验数据在本文中显示的广泛评估研究中的横域重组分类中现有的替代方法的优先效果优异。

translated by 谷歌翻译