So, the ask is to read an excel file from a GCS bucket and create a csv out of it. I have created a structure parser for this.
Now, in CDI, I have created a mapping.
-
The source connection is a google could storage v2.
in the source bucket, I have a flat file (x.txt) that has the link to the actual source excel file. The data is in the following format:
“filepath”
“gs://<bucket_name>/<file_name.xlsx> -
After the source, sits the structure parser. It has two inputs, the file path and the data. I am using the x.txt file to pass the gcs bucket path to the structure parser.
-
One of the groups from the structure parser is passed on to the target.
When I execute this map, I get the error, that the actual source excel file – “file_name.xlsx” can not be read.
The exact error message is as follows: “[ERROR] Cannot find the input file. For a flat file source transformation, verify that you used a reference file. Verify that all referenced file paths are valid. The following file was not found: [File name: gs://<bucket_name>/<file_name>.xlsx]”
Any idea how to resolve this? Thanks in advance.
Bored Panda is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.