最新刊期

卷 40 ，期 6 ， 2025

Material Physics

Multi-peak emission gold clusters for optoelectronic application and temperature-dependent fluorescence properties 封面论文封底论文增强出版 AI导读

“金纳米团簇荧光特性研究取得新进展，通过配体工程策略提升发光性能，为电致发光器件应用提供新思路。”

CHEN Zhenda, DAI Haitao, ZHANG Xiaodong, HAO Xichen, DENG Jiachen, CAO Yaxian, Arain SAMIA, Ahmad ANEELA, LIU Changlong, LI Minxuan, FENG Shouzhong
Vol. 40, Issue 6, Pages: 809-817(2025) DOI: 10.37188/CJLCD.2025-0068

摘要：Au nanoclusters， as a quasi-zero-dimensional semiconductor fluorescent material with excellent luminescence and photostability， have been widely used in the field of bioimaging. The material also has a wide range of application prospects in electroluminescent devices as well as displays. This study proposes a novel ligand engineering strategy employing thermally activated delayed fluorescence （TADF） mechanisms to significantly improve the photoluminescent performance of Au nanoclusters （Au NCs）. The proposed methodology focuses on the efficient harvesting of triplet excitons， which effectively inhibits non-radiative transition pathways. Ultra-small core-shell structured TPA-DCPP/TOAB/AuCs nanoclusters （CS-AuCs） with a size of （1.97±0.31） nm which exhibits thermally activated delayed fluorescence （TADF） were successfully synthesized via ligand engineering. Subsequently， the electronic energy level structure， exciton dynamics， and transition mechanisms were analyzed through the absorption and emission spectra of CS-AuCs via a combined analysis of temperature-dependent absorption/emission spectra and transient photoluminescence （TRPL） measurements. The results show that the short-wavelength fluorescence peaks （507 nm and 528 nm） of CS-AuCs exhibit TADF characteristics， i.e.， their luminescence intensity is enhanced with increasing temperature. The CS-AuCs thin film demonstrated a maximum fluorescence peak intensity enhancement of 5.10-fold@80 ℃. The long-wavelength emission peak（797 nm）， on the other hand， exhibits the temperature-dependent property of normal fluorescence peaks， i.e.， the luminescence intensity decreases with increasing temperature. Meanwhile， the film-forming experiments demonstrated that CS-AuCs has excellent compatibility and film-forming properties while maintaining the fluorescence characteristics， which enhances the potential of Au nanoclusters for applications in display devices such as cluster light emitting diodes （CLED）.

关键词：Au nanocluster;thermally activated delayed fluorescence;core-shell structure;exciton dynamics;temperature-dependent photoluminescence

44

|

9

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 100635001 false

更新时间：2025-06-05

Device Physics and Device Preparation

Linearly polarized light emission of AlGaInP-based red light Micro-LED with subwavelength metal grating 封面论文封底论文增强出版 AI导读

“Micro-LED显示技术因其高亮度、高分辨率和高对比度等优势，被认为最有前景的下一代显示技术。本研究在AlGaInP基红光Micro-LED表面集成亚波长金属光栅，通过调整优化光栅参数和介质层厚度，实现了高消光比的线偏振光输出，为3D Micro-LED显示领域提供解决方案。”

WANG Yawei, ZHENG Kaifeng, LÜ Jinguang, GUO Guangtong, MA Siyao, CHEN Yupeng, ZHAO Baixuan, ZHAO Yingze, NIE Haitao, QIN Yuxin, LI Bin, WANG Weibiao, LIANG Jingqiu
Vol. 40, Issue 6, Pages: 818-826(2025) DOI: 10.37188/CJLCD.2025-0041

摘要：Micro-LED display technology is regarded as the most promising next-generation display technology due to its advantages such as high brightness， high resolution， and high contrast. The development and application of technologies such as 3D display， medical detection， information encryption， visible light communication， efficient liquid crystal backlight display， polarized microscopy， and sensing have introduced new requirements for Micro-LEDs to achieve high extinction ratio polarized light output. To realize the design of a miniaturized red-light Micro-LED device capable of emitting linearly polarized light， this study integrates subwavelength metal gratings on the surface of AlGaInP-based red-light Micro-LEDs and employs the finite-difference time-domain （FDTD） method to conduct theoretical analysis and structural optimization of the metal gratings. The results demonstrate that by optimizing the grating parameters and the thickness of the dielectric layer， a red-light Micro-LED with a chip size of 50 μm achieves an extinction ratio of 26 dB， enabling high extinction ratio linearly polarized light output. The AlGaInP-based linearly polarized Micro-LED designed in this study exhibits significant potential for applications in the field of 3D Micro-LED displays.

关键词：light-emitting diodes;polarization;metal gratings;FDTD method

68

|

34

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 98368302 false

更新时间：2025-06-05
In-plane polarization beam splitter based on patterned liquid crystal alignment 封面论文封底论文增强出版 AI导读

“在光子系统领域，研究者构建了基于液晶折射率梯度分布的可编程光学器件，实现了激光光束的偏振可控分束，为新型可重构光子器件开发提供理论依据与技术路径。”

FANG Xinyu, ZHAO Jiahui, YU Xiao, JIA Qilin, TIAN Jingqi, WANG Ziye, YANG Jidan, LI Bingxiang
Vol. 40, Issue 6, Pages: 827-835(2025) DOI: 10.37188/CJLCD.2025-0064

摘要：Traditional optical beam splitters are limited by their fixed geometric structures and static working modes， making it challenging to meet the demands of dynamically reconfigurable photonic systems. To achieve polarization-controllable laser beam splitting， this study developed a programmable optical device based on liquid crystal gradient refractive index distribution. By establishing an equivalent refractive index model for liquid crystal planar structures， triangular-patterned liquid crystal cells with alignment angles of 30°， 45°， 60°， and 90° were fabricated using digital mask lithography via a digital micromirror device （DMD） chip， systematically constructing the mapping relationship between alignment angles and effective refractive indices. Experimental results revealed that as the alignment angle increased from 30° to 90°， the corresponding effective refractive index exhibited a monotonically increasing trend. Under horizontally linear polarized light incidence， the refraction angle of extraordinary light （e-light） showed a linear growth with increasing incident angle （15°~75°）， confirming the applicability of Snell's law in liquid crystal media. During continuous polarization state modulation from horizontal to vertical orientations， the intensity ratio between e-light and ordinary light （o-light） demonstrated regular evolution， reaching a 1∶1 balanced state at a 46° polarization angle. Circularly polarized light excitation generated symmetrical dual beams， revealing the regulatory mechanism of birefringence phase difference on beam splitting behavior. This work realizes polarization-encoded beam splitting through gradient refractive index design in patterned liquid crystal devices， with its dynamically tunable characteristics providing both theoretical foundation and technical pathways for developing novel reconfigurable photonic devices.

关键词：liquid crystal photoalignment;digital mask lithography;polarization beam splitting;effective refractive index;Snell's law

3

|

0

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 109508011 false

更新时间：2025-06-05
Preparation and photo-magnetic actuation properties of bilayer composite liquid crystal elastomers 封面论文封底论文增强出版 AI导读

“在柔性机器人领域，研究人员采用界面聚合策略，制备出具有光化学-光热-磁三重刺激响应的双层纳米复合液晶弹性体，为软体机器人智能驱动提供新策略。”

CAI Yujie, CONG Huaiping
Vol. 40, Issue 6, Pages: 836-845(2025) DOI: 10.37188/CJLCD.2025-0063

摘要：In order to endow liquid crystal elastomers （LCE） with multiple-stimuli-responsiveness and enhance their potential application in flexible robotics， we developed an interfacial polymerization strategy to fabricate a bilayer nanocomposite LCE with photochemical， photothermal， and magnetic responsiveness. The Fe₃O₄ nanoparticles coated with octa-vinyl polyhedral oligomeric silsesquioxane （POSS@Fe₃O₄ NPs） were sprayed onto the surface of partially polymerized azobenzene LCE， and then induced the free radical polymerization between the unreacted acrylic-ester groups in the polymeric matrix and the abundant vinyl on the magnetic nanomaterials layer， to yield a POSS@Fe₃O₄ NPs-LCE composite （PFe-LCEs） with a tightly-integrated organic-inorganic interface. With merits of the UV-induced isomerization of azobenzene mesogens and the photothermal conversion capability of POSS@ Fe₃O₄ NPs， the composite elastomer could generate a 200° bending in 2 s under UV irradiation and returned to its initial state within 7.5 s under infrared light exposure. In contrast， utilizing visible light irradiation for 50 s can only restore 83% of the original shape. Additionally， this material exhibits magnetic responsiveness， allowing it to deflect under an applied magnetic field. The PFe-LCE was used to fabricate a quadrupedal transport robot capable of grasping， transporting， and releasing cargo through light-magnetic synergistic actuation in a complex environment. The demonstrated environmental adaptability highlighted a novel strategy for intelligent actuation in soft robotics.

关键词：liquid crystal elastomers;multiple-stimulus-responsiveness;interfacial polymerization;bilayer structure;soft robots

80

|

30

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 89764695 false

更新时间：2025-06-05
Fabrication of photomask based on patterned guest-host liquid crystal 封面论文封底论文增强出版 AI导读

“微纳结构制造领域取得新突破，专家基于图案化取向宾主液晶设计掩模板，实现微米级结构精准制造，为低成本微结构加工提供新思路。”

WANG Junhui, LIU Xuan, DUAN Xia, LI Boyuan, YUAN Conglong, ZHENG Zhigang
Vol. 40, Issue 6, Pages: 846-853(2025) DOI: 10.37188/CJLCD.2025-0050

摘要：Micro-nano structures play a crucial role in numerous fields， particularly in optics， sensing， and display technologies. Although various techniques have been developed for micro-nano structure fabrication， achieving these structures through simple and low-cost approaches remains a significant challenge. To address this issue， this paper proposes a design based on patterned oriented guest-host liquid crystals， which successfully realizes the preparation of mask templates constructed on demand. Specifically， the patterned mask is successfully established through the combination of the anisotropic absorption characteristics of guest-host liquid crystals to polarized light and the photo-alignment technology， achieving precise spatial intensity regulation of the incident light field. By introducing it into the laser direct writing system， micro-nano structures with various geometric patterns were successfully formed on the surface of the photoresist. The structure is in close agreement with the predefined mask， demonstrating the capability of structural manufacturing at the micron level （feature size 5 μm）. In addition， the self-assembly of liquid crystal on the micro-nano structure enable period grating， thereby achieving stable diffraction under short-wavelength conditions. This result further demonstrates that the designed liquid crystal mask can achieve precise control over the intensity of incident light by accurately modulating the spatial distribution of the optical field. This study provides a novel and cost-effective approach for microstructure fabrication， offering new opportunities for the development of large-area， rapid manufacturing techniques for complex micro-nano structures.

关键词：Guest-host liquid crystal;photomask;photoalignment;micro-nano structures

73

|

24

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 88742689 false

更新时间：2025-06-05
Research progress of damage to liquid crystal display modules in humid-heat and cold environment 封面论文封底论文增强出版 AI导读

“液晶显示技术在军事、汽车、医疗等领域应用广泛，但在湿热寒冷环境下易受损。专家综述了损伤情况，探讨了优化措施，为未来发展提供思路。”

ZHOU Rui, NING Honglong, WEI Xiaoqin, ZHOU Jianhua, LIU Yuxiang, ZHANG Kangping, LI Muyun, WANG Lei, YAO Rihui, PENG Junbiao
Vol. 40, Issue 6, Pages: 854-866(2025) DOI: 10.37188/CJLCD.2025-0025

摘要：Liquid Crystal Display （LCD） is a technology that uses liquid crystal materials to display images. It has advantages such as high durability， reliability， low power consumption， high resolution， and low radiation， and is widely used in military， automotive， medical equipment and other fields. In actual application environments， due to the differences in application regions and fields， LCD modules are often in wet and hot or cold environments. In humid-heat and cold environments， LCD modules will encounter problems such as image distortion， internal structure corrosion， “Mura” defects， and deformation of liquid crystal molecules. All these may cause irreversible damage to the modules and lead to the screen being unable to function properly. This paper provides a comprehensive review of damage conditions in LCD modules under humid-heat and cold environments. It introduces the fundamental components and working principles of liquid crystal displays， discusses the optimization measures implemented under different environmental conditions， and analyzes the potential issues and challenges in future development. The study concludes with perspectives on technological advancements and application prospects in this field.

关键词：liquid crystal display;humid-heat and cold environment;module damage

286

|

42

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 85785981 false

更新时间：2025-06-05

Image Processing

3D hand reconstruction method based on adaptive occlusion recovery and topology-pose bidirectional perception 封面论文封底论文增强出版 AI导读

“在三维手部网格重建领域，专家提出了一种实时高效的两阶段网络框架，有效解决了遮挡和高灵活度手部姿态下的重建难题，为抗遮挡、实时且高效、姿态准确的三维手部重建提供了解决方案。”

LIU Jia, HUANG Nanxuan, CHEN Dapeng, WEI Lina
Vol. 40, Issue 6, Pages: 867-880(2025) DOI: 10.37188/CJLCD.2024-0345

摘要：Existing 3D hand mesh reconstruction methods face multiple challenges， especially when dealing with occlusion and highly flexible hand poses， which lead to issues such as missing geometric information and topological errors. To enable accurate and efficient 3D hand reconstruction under occlusion， this paper proposes a two-stage network framework for real-time and efficient reconstruction of hand 3D meshes from monocular RGB images. In the first stage， the adaptive occlusion recovery module is designed by introducing learnable attention weight masks and region consistency loss within the attention mechanism. This module targets occluded regions and adaptively recovers information， significantly enhancing feature representation ability under occlusion. In the second stage， the paper combines static topology modeling and dynamic pose perception， as well as feature information exchange between bidirectional graph convolutions and a novel joint rotation-aware attention. This results in the topology-pose bidirectional perception module， which achieves complementary enhancement of static and dynamic features， improving the ability to capture fine-grained joint details. The proposed method is evaluated through qualitative and quantitative experiments on the FreiHAND and InterHand2.6M datasets， compared with state-of-the-art methods. Experimental results show that on the FreiHAND dataset， the proposed method achieves the PA-MPVPE reduction to 6.1 mm with an inference speed of 39 FPS， and on the InterHand2.6M dataset， the MPJPE of the proposed method is reduced to 8.07 mm， and the MPVPE is reduced to 8.22 mm. The proposed approach meets the requirements for robust occlusion handling， real-time performance， and accurate pose estimation in 3D hand reconstruction.

关键词：3D hand mesh reconstruction;attention mechanism;graph convolutional network;regional consistency loss;deep learning

42

|

22

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 82340472 false

更新时间：2025-06-05
Reconstruction method of video snapshot compressive imaging based on Mamba-2 封面论文封底论文增强出版 AI导读

“在视频快照压缩成像领域，M2BA-SCI算法实现了高质量与快速重建的平衡，为视频SCI重建提供了新思路。”

SHI Dunpan, XU Wei, PIAO Yongjie, FANG Yinghong, JI Haolin, LI Pengfei
Vol. 40, Issue 6, Pages: 881-894(2025) DOI: 10.37188/CJLCD.2024-0356

摘要：Video snapshot compressive imaging （SCI） is a novel imaging technique. It captures three-dimensional video data using a two-dimensional detector within a single exposure time and then reconstructs the original video data with appropriate algorithms. Although many current algorithms have outstanding performance in the reconstruction tasks of video SCI， the improvement of their reconstruction quality often comes at the cost of sacrificing the reconstruction speed， which significantly reduces the real-time performance of the algorithms. To balance reconstruction quality and speed， this paper proposes an end-to-end deep video SCI reconstruction method based on Mamba-2， namely M2BA-SCI. The M2BA-SCI network consists of a preprocessing module， a token generation block， Mamba attention blocks， and a video reconstruction block. Among them， the Mamba attention blocks are mainly composed of Mamba-2 linear attention blocks and feed-forward neural networks. A large number of experiments on simulated and real video datasets show that M2BA-SCI achieves a more balanced effect compared with previous algorithms. It maintains a relatively fast reconstruction speed while improving the reconstruction quality. In the benchmark grayscale video dataset， the average PSNR is 34.85， the average SSIM is 0.966， and the running time is 0.23 s. In the benchmark color video dataset， the average PSNR is 36.21， the average SSIM is 0.963， and the running time is 1.03 s. M2BA-SCI brings new ideas to video SCI reconstruction and provides a reference for designing algorithms with higher reconstruction quality based on the Mamba model.

关键词：video snapshot compressive imaging;compressive sensing;mamba-2;deep learning

61

|

43

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 82341228 false

更新时间：2025-06-05
Robust stitching method for borehole wall images under multi-interference imaging conditions 封面论文封底论文增强出版 AI导读

“在地质勘探领域，专家提出了一种钻孔内壁图像鲁棒拼接方法，有效解决了成像干扰和信息缺失问题，提升了拼接质量。”

LUO Qi, XU Wangming, LI Yaoxiang, ZHAO Yuhang
Vol. 40, Issue 6, Pages: 895-904(2025) DOI: 10.37188/CJLCD.2025-0018

摘要：Aiming at the problem that the stitching of borehole wall images in geological exploration is easily affected by multiple factors such as imaging illumination conditions， probe disturbance， and information missing of depth or azimuth， a robust stitching method for borehole wall images under multi-interference imaging conditions is proposed. Firstly， an image mapping method with conditional constraints is adopted to expand a specific sampling region of the borehole wall image into a narrowband image that can be effectively used for stitching under the interference of the imaging light source. Then， the intrinsic characteristics of the images in borehole video are utilized， and an ROI-based registration strategy is adopted to realize the accurate registration of the unfolded narrowband borehole images under the conditions of probe disturbance and information missing of depth and azimuth， and then a precise registration of the unfolded narrowband image of the borehole under the conditions of probe disturbance， missing depth or azimuthal angle. At the same time， a grid-based matching strategy is applied to improve the registration speed. Finally， a row-by-row dynamically weighted image fusion method is adopted to eliminate the stitching traces according to the registration results， which further optimizes the quality of the stitched images. Through the experiments on the simulated images under different geological conditions， the performance indexes of the proposed method with average value of SSIM of 71.39%， average value of PSNR of 26.74， and average value of registration accuracy of 92.38% are better than those of the comparison method. Through the experimental results on the real images of different borehole videos， the objective evaluation indexes of the stitching results obtained by the proposed method， such as EN， SF， AG， MI， and Q_MI， etc.， are all improved.

关键词：image stitching;image registration;image fusion;borehole wall image

5

|

0

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 109507701 false

更新时间：2025-06-05
Arbitrary shape text detection method based on Fourier centerline prediction 封面论文封底论文增强出版 AI导读

“文本检测技术取得新突破，利用傅里叶变换构建文本特征，预测文本中心线重建复杂形状文本框，提升任意形状文本检测性能。”

BAI Kun, WANG Zhe, MA Long, XUE Yao, LI Guodong, YAN Tian, WANG Xiaotian
Vol. 40, Issue 6, Pages: 905-914(2025) DOI: 10.37188/CJLCD.2025-0033

摘要：Text detection techniques have become highly mature， but detecting text with arbitrary shapes remains a major challenge in text detection tasks due to geometric encoding limitations of text boxes. In natural scenes， text exhibits a diverse range of shapes， and real-world text is subject to influences such as shooting angles， physical distortions of background objects， and the inherent curvature of the text itself. Merely using rectangular bounding boxes proves insufficient for encompassing irregular text instances. To improve the issue of detecting text with arbitrary shapes， a method that utilizes Fourier transforms in the frequency domain to construct text features is proposed， enabling the reconstruction of predicted boxes by predicting the text’s central line. The predicted text central line not only assists in reconstructing text boxes with complex shapes， but also aids in the subsequent text recognition process through central line correction. The proposed method achieves quite competitive performance on challenging arbitrary-shaped text detection dataset CTW1500， TotalText.

关键词：scene text detection;deep neural network;arbitrary text shape

36

|

28

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 85786030 false

更新时间：2025-06-05
Small object detection algorithm in UAV aerial images based on improved YOLO11 封面论文封底论文增强出版 AI导读

“在无人机航拍图像小目标检测领域，ACFI-YOLO11算法通过轻量化设计和跨层特征交互，显著提升了检测性能，为解决小目标检测问题提供了高效实用方案。”

ZHANG Zhihao, LI Xiaorun, CHEN Shuhan
Vol. 40, Issue 6, Pages: 915-930(2025) DOI: 10.37188/CJLCD.2025-0010

摘要：Small object detection in UAV aerial images faces challenges such as small object sizes， complex backgrounds， and limited computational resources. Most existing object detection models deployed on UAVs suffer from low accuracy and struggle to achieve a good balance between detection accuracy and efficiency. To address these issues， this paper proposes a lightweight small object detection algorithm， ACFI-YOLO11 （Attention-based Cross-layer Feature Interaction-YOLO11）， based on YOLO11 framework. First， this paper designs a Tiny Head branch， which enhances the model’s ability to detect tiny objects by introducing higher-resolution feature maps. Second， this paper proposes a novel attention-based cross-layer feature interaction module （ACFI）. This module uses a layer feature aggregation （LFA） mechanism and a Transformer encoder to enable direct information exchange between the current layer and its adjacent layers. This approach addresses the limitations of the original model’s neck network， where features were passed sequentially and primarily focused on the preceding layer， failing to fully explore and leverage cross-layer feature correlations. The proposed module significantly enhances the model’s representational capacity. Finally， this paper introduces space-to-depth （SPD） convolution to replace traditional convolution. This reduces the model’s parameters and computational cost while preserving critical spatial information during downsampling， thereby improving detection accuracy for small objects. Experimental results on the VisDrone2021 dataset show that compared to YOLO11s， ACFI-YOLO11 achieves improvements of 4.2%， 3.5%， 5.2%， and 4.0% in AP_S， AP_XS， mAP₅₀， and mAP_50-95， respectively， and outperforms other comparison algorithms with a mAP_50-95 of 31.7%. Furthermore， comparative experiments on the UAVDT dataset validate the superiority of ACFI-YOLO11， achieving a mAP_50-95 of 83.3%， significantly outperforming other state-of-the-art algorithms. These results demonstrate that ACFI-YOLO11 not only achieves a lightweight model design but also significantly enhances detection performance， providing an efficient and practical solution for small object detection in drone aerial imagery.

关键词：small object detection;unmanned aerial vehicle;YOLO11;feature fusion;Transformer

362

|

66

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 82341264 false

更新时间：2025-06-05
Multi-scale pedestrian detection algorithm based on joint head and overall information 封面论文封底论文增强出版 AI导读

“在密集场景行人检测领域，研究者改进Faster R-CNN算法，提出基于头部和整体信息联合的多尺度行人检测算法，有效提升了检测准确性。”

MA Ximing, LI Ning, WU Di, WANG Jianfei, YU Xiangyue
Vol. 40, Issue 6, Pages: 931-941(2025) DOI: 10.37188/CJLCD.2024-0352

摘要：In dense scenes， occlusion of the pedestrian body and varying pedestrian scales are the main reasons for the degradation of pedestrian detection accuracy. Since the head region of pedestrians is less occluded， it can be used to assist detection. In this regard， this paper improves the Faster R-CNN algorithm and proposes a multi-scale pedestrian detection algorithm based on the joint head and overall information. First， a recursive multi-scale feature extraction network incorporating the coordinate attention mechanism is designed for obtaining rich and detailed multi-scale feature information and enhancing the sensitivity of the network to small-scale pedestrian locations. Then， the region proposal network is utilized to simultaneously generate pedestrian head and body proposals， and a head and body detection branch is constructed for joint detection. Finally， an adaptive joint non-maximum suppression algorithm is proposed so that the detection boxes with severe overlap are not over-suppressed and the false detection boxes generated by the two detection branches are screened out simultaneously to further enhance the accuracy of pedestrian detection. The experimental results show that compared with the baseline algorithm， the proposed algorithm improves the average precision by 2.9% on the CrowdHuman dataset and reduces the log-average miss detection rate by 4%， and reduces the log-average miss detection rate by 2.4% and 2.2% on the two small-scale subsets of the TJU-DHD-pedestrian dataset， which verifies the effectiveness of the proposed algorithm in the dense scene.

关键词：pedestrian detection;joint detection;multi-scale feature fusion;attention mechanism;post-processing optimization

45

|

23

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 82340687 false

更新时间：2025-06-05
Research progress on absolute visual localization of unmanned aerial vehicles 封面论文封底论文增强出版 AI导读

“无人机技术在绝对视觉定位领域取得突破，深度学习提升定位精度与鲁棒性，为无人机自主导航提供新方向。”

MIAO Zongcheng, ZHOU Runxi, LIU Jie, YANG Haiyan
Vol. 40, Issue 6, Pages: 942-955(2025) DOI: 10.37188/CJLCD.2024-0350

摘要：In recent years， the application fields of UAV technology have been expanding， covering a wide range of fields such as military and civilian. Especially in the field of absolute visual localization， the introduction of deep learning has brought significant breakthroughs to traditional methods and promoted the technological development in this field. Compared with the early methods relying on traditional computer vision feature extraction， absolute visual localization methods based on deep learning have achieved significant improvements in localization accuracy and robustness. In this paper， we systematically review deep learning-based absolute visual localization methods for UAVs， and explore the application potential and development prospects of absolute visual localization technology by analyzing the limitations of global navigation satellite systems and relative visual localization in complex environments. The key research progress in this field is summarized， including dataset construction and deep learning model optimization， and the current technical bottlenecks are dissected. Finally， future research directions are proposed to further promote the wide application and sustainable development of absolute visual localization technology in the field of autonomous UAV navigation.

关键词：UAV;deep learning;localization;absolute visual localization

160

|

48

|

0

<HTML>
<L-PDF><WORD><Meta-XML>

<引用本文> <批量引用> 82340659 false

更新时间：2025-06-05

Address：No.3888 Dong Nanhu Road, Changchun, Jilin, China 130033 Postal code：130033
Tel：0431-86176059 Email：yjxs@ciomp.ac.cn
Technical support is provided by Beijing Founder electronics co., LTD 吉ICP备11002662号-17 京公网安备11010802024621
It is recommended to read the content of this site in Chrome&IE9+. Please switch to extreme mode in browser 360.
Cookies We use cookies to help provide and enhance our service and tailor content. By continuing, you agree to the use of cookies.

⁰

最新刊期

Multi-peak emission gold clusters for optoelectronic application and temperature-dependent fluorescence properties 封面论文 封底论文 增强出版 AI导读

Linearly polarized light emission of AlGaInP-based red light Micro-LED with subwavelength metal grating 封面论文 封底论文 增强出版 AI导读

In-plane polarization beam splitter based on patterned liquid crystal alignment 封面论文 封底论文 增强出版 AI导读

Preparation and photo-magnetic actuation properties of bilayer composite liquid crystal elastomers 封面论文 封底论文 增强出版 AI导读

Fabrication of photomask based on patterned guest-host liquid crystal 封面论文 封底论文 增强出版 AI导读

Research progress of damage to liquid crystal display modules in humid-heat and cold environment 封面论文 封底论文 增强出版 AI导读

3D hand reconstruction method based on adaptive occlusion recovery and topology-pose bidirectional perception 封面论文 封底论文 增强出版 AI导读

Reconstruction method of video snapshot compressive imaging based on Mamba-2 封面论文 封底论文 增强出版 AI导读

Robust stitching method for borehole wall images under multi-interference imaging conditions 封面论文 封底论文 增强出版 AI导读

Arbitrary shape text detection method based on Fourier centerline prediction 封面论文 封底论文 增强出版 AI导读

Small object detection algorithm in UAV aerial images based on improved YOLO11 封面论文 封底论文 增强出版 AI导读

Multi-scale pedestrian detection algorithm based on joint head and overall information 封面论文 封底论文 增强出版 AI导读

Research progress on absolute visual localization of unmanned aerial vehicles 封面论文 封底论文 增强出版 AI导读

Multi-peak emission gold clusters for optoelectronic application and temperature-dependent fluorescence properties 封面论文封底论文增强出版 AI导读

Linearly polarized light emission of AlGaInP-based red light Micro-LED with subwavelength metal grating 封面论文封底论文增强出版 AI导读

In-plane polarization beam splitter based on patterned liquid crystal alignment 封面论文封底论文增强出版 AI导读

Preparation and photo-magnetic actuation properties of bilayer composite liquid crystal elastomers 封面论文封底论文增强出版 AI导读

Fabrication of photomask based on patterned guest-host liquid crystal 封面论文封底论文增强出版 AI导读

Research progress of damage to liquid crystal display modules in humid-heat and cold environment 封面论文封底论文增强出版 AI导读

3D hand reconstruction method based on adaptive occlusion recovery and topology-pose bidirectional perception 封面论文封底论文增强出版 AI导读

Reconstruction method of video snapshot compressive imaging based on Mamba-2 封面论文封底论文增强出版 AI导读

Robust stitching method for borehole wall images under multi-interference imaging conditions 封面论文封底论文增强出版 AI导读

Arbitrary shape text detection method based on Fourier centerline prediction 封面论文封底论文增强出版 AI导读

Small object detection algorithm in UAV aerial images based on improved YOLO11 封面论文封底论文增强出版 AI导读

Multi-scale pedestrian detection algorithm based on joint head and overall information 封面论文封底论文增强出版 AI导读

Research progress on absolute visual localization of unmanned aerial vehicles 封面论文封底论文增强出版 AI导读