See Message 7. In this kind of situation:

if, for example, either the 2445-15 or 2445-16 Text object is the "current" one, and object selection is "reaching out" around it looking for other nearby Text objects, if it reaches out far enough to find the bottom one in its own group, it will also find at least 3 if not all 4 of the group above. If it reaches out in small steps until it finds 3 additional objects [though not all groups have 4 -- another complication], at least one of them is sure to be in the group above, and it will miss at least the bottom one in its own group.
Filtering by color could limit, but not eliminate, those kinds of proximity selection errors.
Is there enough regularity in these to use their content as the basis for which ones do the reaching out? Only those that start with a 2? Would no others ever start that way? [I didn't investigate the entire drawing.] If that's a reliable criterion, the "reaching out" could extend downward only, rather than in all directions.
But then there are a few groups like this, which doesn't include the same kind of "header" Text as almost all others:

And should groups like these turn into left-aligned Mtext like most of the groups, or should the non-alignment remain somehow? [They're not right-justified, either.]

Really insurmountable:

I'm just having a hard time imagining a set of relationship and/or content criteria that could be counted on to find only the ones you really want to join together, and omit others, but maybe you can delineate something.
Kent Cooper, AIA