Computer science

FitDepth: fast and lite 16-bit depth image compression algorithm

Abstract This article presents a fast parallel lossless technique and a lossy image compression technique for 16-bit single-channel images. Nowadays, such techniques are “a must” in robotics and other areas where several depth cameras are used. Since many of these algorithms need to be run in low-profile hardware, as embedded systems, they should be very fast and customizable. The proposal is ...
1 year ago

E2E-BPF microscope: Extended depth-of-field microscopy using learning-based implementation of binary phase filter and image deconvolution

Abstract Several image-based biomedical diagnoses require high-resolution imaging capabilities at large spatial scales. However, conventional microscopes exhibit an inherent trade-off between depth-of-field (DoF) and spatial resolution, and thus require objects to be refocused at each lateral location, which is time-consuming. Here, we present a computational imaging platform, termed E2E-BPF micro...
1 year ago

Improving Chest X-ray Report Generation by Leveraging Text of Similar Images

Abstract Automatic medical report generation is the production of reports from radiology images that are grammatically correct and coherent. Encoder-decoder is the most common architecture for report generation, which has not achieved to a satisfactory performance because of the complexity of this task. This paper presents an approach to improve the performance of report generation that can be eas...
1 year ago

Robust Evidence C-Means Clustering Combining Spatial Information for Image Segmentation

Abstract Although evidence c-means clustering (ECM) based on evidence theory overcomes the limitations of fuzzy theory to some extent and improves the capability of fuzzy c-means clustering (FCM) to express and process the uncertainty of information, the ECM does not consider the spatial information of pixels, which makes it to be unable to effectively deal with noise pixels. Applying ECM directly...
1 year ago

Attention Inspiring Receptive-Fields Multi-Task Network via Self- supervised Learning for Violence Recognition

Abstract Generally, a large amount of training data is essential to train deep learning model for obtaining more accurate detection performance in computer vision domain. However, to collect and annotate datasets will lead to extensive cost. In this letter, we propose a self-supervised auxiliary task to learn general videos features without adding any human-annotated labels, aiming at improving th...
1 year ago

Image-Based Malware Detection Using α-Cuts and Binary Visualisation

Image conversion of malicious binaries, or binary visualisation, is a relevant approach in the security community. Recently, it has exceeded the role of a single-file malware analysis tool and has become a part of Intrusion Detection Systems (IDSs) thanks to the adoption of Convolutional Neural Networks (CNNs). However, there has been little effort toward image segmentation for the converted image...
1 year ago

An Optical Remote Sensing Image Matching Method Based on the Simple and Stable Feature Database

Satellite remote sensing has entered the era of big data due to the increase in the number of remote sensing satellites and imaging modes. This presents significant challenges for the processing of remote sensing systems and will result in extremely high real-time data processing requirements. The effective and reliable geometric positioning of remote sensing images is the foundation of remote sen...
1 year ago

Non-Linear Signal Processing Methods for UAV Detections from a Multi-Function X-Band Radar

This article develops the applicability of non-linear processing techniques such as Compressed Sensing (CS), Principal Component Analysis (PCA), Iterative Adaptive Approach (IAA), and Multiple-input-multiple-output (MIMO) for the purpose of enhanced UAV detections using portable radar systems. The combined scheme has many advantages and the potential for better detection and classification accurac...
1 year ago

Laser-Visible Face Image Translation and Recognition Based on CycleGAN and Spectral Normalization

The range-gated laser imaging instrument can capture face images in a dark environment, which provides a new idea for long-distance face recognition at night. However, the laser image has low contrast, low SNR and no color information, which affects observation and recognition. Therefore, it becomes important to convert laser images into visible images and then identify them. For image translation...
1 year ago

Flood-Related Multimedia Benchmark Evaluation: Challenges, Results and a Novel GNN Approach

This paper discusses the importance of detecting breaking events in real time to help emergency response workers, and how social media can be used to process large amounts of data quickly. Most event detection techniques have focused on either images or text, but combining the two can improve performance. The authors present lessons learned from the Flood-related multimedia task in MediaEval2020, ...
1 year ago

Showing Page 2 to 4