![]() |
Eigen-unsupported
5.0.1-dev+7c7d8473
|
#include <unsupported/Eigen/CXX11/src/Tensor/TensorExecutor.h>
Process all the data with a single cpu thread, using blocks of data. By sizing a block to fit L1 cache we get better cache performance.