RaSPDAM (Radio Single-Pulse Detection Algorithm Based on Visual Morphological Features) is a novel machine learning algorithm specifically designed for the detection of Fast Radio Bursts (FRBs). Developed to address the computational and time challenges associated with traditional FRB search methods, RaSPDAM offers significant improvements in both efficiency and accuracy.
Fast Radio Bursts (FRBs) are brief, intense pulses of radio energy originating from distant galaxies. Their discovery and study are crucial for understanding the distribution and evolution of matter in the universe. However, the detection of FRBs is a challenging task due to the vast amounts of data produced by radio telescopes and the computational complexity of existing search methods. Traditional techniques often struggle with detecting weak signals and are time-consuming.
The preprocessing pipeline begins by converting the original signal sequence into standardized images without de-dispersion, where the sequence is divided into smaller slices using a 2-second sliding window with a 1-second overlap. These image slices are then resized to 512 × 512 pixels to standardize the model input. To enhance signal characteristics, particularly the typical curve of the FRB signal amidst noise, convolution is applied using multiple kernels generated from fixed curve slopes, transforming the image by processing each pixel and its local neighbors. After this, morphological dilation further enhances the signal. Finally, the enhanced images are combined with the original image to form an RGB image, which serves as the model input, as illustrated in Figure 1.
RaSPDAM utilizes a U-Net architecture, a convolutional neural network (CNN) designed for image segmentation tasks. This U-Net model comprises an encoder that extracts slope-based features from the input image and a decoder that reconstructs the image using these features. Despite this process, some noise may still remain in the output, so we apply additional filtering steps to identify potential FRB signals. Using the "regionprops" function, we analyze connected regions in the segmented image and calculate each candidate's projections on the x- and y-axes. Candidates exceeding defined thresholds on both axes are flagged as potential FRB signals.
For more details, please refer to the paper.
The testing of RaSPDAM is facilitated by the FAST-FREX dataset, which is built upon observations obtained by the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The dataset consists of:
- 600 positive samples: Observed FRB signals from three sources (FRB20121102, FRB20180301, and FRB20201124).
- 1000 negative samples: Noise and Radio Frequency Interference (RFI).
- High Precision: RaSPDAM achieves a precision of 98.73%, significantly outperforming traditional methods like PRESTO and Heimdall.
- High Recall: With a recall rate of 77.67%, RaSPDAM effectively identifies a large proportion of true FRB signals.
- F1 Score: An F1 score of 0.8694 indicates a well-balanced trade-off between precision and recall.
- ToA and DM: While RaSPDAM currently provides Time of Arrival (ToA) as a result, future enhancements aim to include Dispersion Measure (DM) for more comprehensive signal verification.
Comparison with Traditional Methods
Software | TN | TP | FN | FP | Recall | Precision | F1 Score |
---|---|---|---|---|---|---|---|
PRESTO | 3 | 472 | 0 | 26963700 | 0.7867 | 1.7505E-05 | 3.5009E-05 |
Heimdall | 218 | 489 | 36 | 5854 | 0.8150 | 0.0771 | 0.1409 |
RaSPDAM | 989 | 466 | 128 | 6 | 0.7767 | 0.9873 | 0.8694 |
Since its deployment, RaSPDAM has been instrumental in identifying:
- 2 new FRBs: FRB20211103A and FRB20230104.
- 80 pulsars: Including 13 previously undiscovered pulsars, highlighting the algorithm's efficacy in uncovering new celestial objects.
pip install -r requirements.txt
python raspdam.py <fits_file> [-m <model_path>] [-o <output_path>] [-sigmoid_threshold <value>] [-box_fill_threshold <value>] [-box_projection_threshold <value>]
- fits_file: The path to the FITS file to be processed (required).
- -m, -model_path: The path to the model file (optional, default: models).
- -o, -output_path: The path to save the output file (optional, default: current directory).
- -sigmoid_threshold: Threshold value for the sigmoid activation function (optional, default: 0.5).
- -box_fill_threshold: Minimum percentage threshold for the area filled within each bounding box (optional, default: 0.25).
- -box_projection_threshold: Minimum projection percentage threshold for bounding box filtering (optional, default: 0.15).