With a 14178107*8 vector, a 108GB memory machine is quickly used up. Is there a way to reduce the memory footprint? The train output: ``` Input: 14178107 points, dimension 8 scheduler = Parlay-HomeGrown num-threads = 16 num-cell = 12333095 compute-grid = 5.06638 ```