Relative Content

Tag Archive for apache-sparkhadooppyspark

There is an error while trying to load multiple files with Spark pypark

staticDataFrame = spark.read.format(“csv”) .option(“header”, “true”) .option(“inferSchema”, “true”) .load(“data/retail-data/by-day/*.csv”) In the above code, theres is this error: Exception in thread “globPath-ForkJoinPool-1-worker-1” java.lang.UnsatisfiedLinkError: ‘boolean org.apache.hadoop.io.nativeio.NativeIO$Windows.access0(java.lang.String, int)’ If i load only one file, there is no error. apache-spark hadoop pyspark