* add a transposed version of the LQUP decomposition routine LUdivine
* fix many bugs in LUdivine
* new schedules for Winograd algorithm for matrix multiplication: 2 cases depending whether beta = 0 or not, taken form [Huss Ledermann & Al. 96]
* new tests for ftrtri, echelon and redechelon