I understand in many cases the point is a cheaper solution but perhaps it's better to focus on something more specific to what will work.

The best solution I have had is stacking Macs. The output of one I feed to the input of the machine I have running Catalyst. This proves to work best.

Something not noted yet is the significance of delay. Each device in the chain creates a frame delay. This is especially true of cheaper or lower quality devices.