Performs a fuzzy merge of two TCR data.frames: for each row in
left_df, finds rows in right_df within radius
TCRdist units. Similar to a SQL JOIN but using distance instead of
exact matching.
Arguments
- left_df
Data.frame with TCR columns (
va,vb,cdr3a,cdr3b) plus any extra columns.- right_df
Data.frame with TCR columns (
va,vb,cdr3a,cdr3b) plus any extra columns.- organism
Character string (
"human"or"mouse").- radius
Numeric. Maximum TCRdist for a match.
- max_n
Integer. Maximum number of matches per left row. Default
5L. Closest matches are kept.- type
Character string. Join type:
"inner"(default) keeps only matched pairs,"left"keeps all left rows (NAs for unmatched).- suffix
Character vector of length 2. Suffixes for disambiguating column names. Default
c("_x", "_y").- rect_dist_matrix
Optional precomputed rectangular distance matrix (rows = left, cols = right). If provided,
organismis not used for distance computation.