问题描述
我正在尝试对将函数应用于数组的不同方法进行基准测试.
I'm trying to benchmark different ways to apply a function to an array.
为什么是 https://software.intel.com/sites/landingpage/IntrinsicsGuide/#expand=3260,2124,4779,4779&cats=Trigonometry&text=_sin
_mm_sin_ps
我的范围不知道,但 _mm_sqrt_ps
是?
_mm_sin_ps
not known to my scope but _mm_sqrt_ps
is?
我如何让它为人所知?并且编译没有错误.
how do I make it known? And compile it without errors.
#include <random>
#include <iostream>
#include <cmath>
#include <chrono>
#include <algorithm>
#include <valarray>
#include "immintrin.h"
#include <array>
int main()
{
std::cout<<"start
";
std::random_device rd;
std::mt19937 gen(rd());
std::uniform_real_distribution<> dis(-1000, 1000);
int N=100;
while(N--)
{
std::cout<<"
N: "<<N;
const int T1=4E6;
{
int T=T1,T0=T1/4;
std::array<float,T1> array;
while(T--)
{
array[T]=dis(gen);
}
auto start_time = std::chrono::high_resolution_clock::now();
auto it =array.begin();
while(T0--)
{
__m128 X = _mm_loadu_ps(it);
__m128 result = _mm_sin_ps(X);
_mm_storeu_ps(it, result);
it+=4;
}
auto time2=std::chrono::high_resolution_clock::now()-start_time;
std::cout<<"
intr1: "<<std::chrono::duration_cast<std::chrono::microseconds>(time2).count();
}
}
std::cout<<"
fin
";
return 0;
}
编译器
g++ -v
Using built-in specs.
COLLECT_GCC=g++
COLLECT_LTO_WRAPPER=/usr/lib/gcc/x86_64-linux-gnu/4.8/lto-wrapper
Target: x86_64-linux-gnu
Configured with: ../src/configure -v --with-pkgversion='Ubuntu 4.8.2-19ubuntu1' --with-bugurl=file:///usr/share/doc/gcc-4.8/README.Bugs --enable-languages=c,c++,java,go,d,fortran,objc,obj-c++ --prefix=/usr --program-suffix=-4.8 --enable-shared --enable-linker-build-id --libexecdir=/usr/lib --without-included-gettext --enable-threads=posix --with-gxx-include-dir=/usr/include/c++/4.8 --libdir=/usr/lib --enable-nls --with-sysroot=/ --enable-clocale=gnu --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-gnu-unique-object --disable-libmudflap --enable- plugin --with-system-zlib --disable-browser-plugin --enable-java-awt=gtk --enable-gtk-cairo --with-java-home=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64/jre --enable-java-home --with-jvm-root-dir=/usr/lib/jvm/java-1.5.0-gcj-4.8-amd64 --with-jvm-jar-dir=/usr/lib/jvm-exports/java-1.5.0-gcj-4.8-amd64 --with-arch-directory=amd64 --with-ecj-jar=/usr/share/java/eclipse-ecj.jar --enable-objc-gc --enable-multiarch --disable-werror --with-arch-32=i686 --with-abi=m64 --with-multilib-list=m32,m64,mx32 --with-tune=generic --enable-checking=release --build=x86_64-linux-gnu --host=x86_64-linux-gnu --target=x86_64-linux-gnu
Thread model: posix
gcc version 4.8.2 (Ubuntu 4.8.2-19ubuntu1)
推荐答案
_mm_sin_ps
是 SVML 库,仅随英特尔编译器提供.GCC 开发人员专注于包装机器指令和简单的任务,因此目前 immintrin.h
中没有 SVML.
_mm_sin_ps
is part of the SVML library, shipped with intel compilers only. GCC developers focused on wrapping machine instructions and simple tasks, so there's no SVML in immintrin.h
so far.
您必须使用库或自己编写.窦道实施:
You have to use a library or write it by yourself. Sinus implementation:
- 泰勒系列
- CORDIC
- 二次曲线
这篇关于C++ 错误:“_mm_sin_ps"未在此范围内声明的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持跟版网!