error writing from csv to mysql using pentaho pdi

error writing from csv to mysql using pentaho pdi - pentaho

I am a newbee to pentaho.
Is there any reason why I am having this error as try to write csv content to mysql using dataset from the link below:
"Unknown Latitude' in field list.
I am using dataset from link below:
http://openflights.org/data.html
I have 3 steps:
- "csv file input" which previews fine
- "select values" which also previews fine
- a "table output" where the proble lies.
regards

Related

Can't update CSV file to BQR

I tried to update manually this data sample to BigQuery (after tried to update from Google Cloud, I extracted some rows to detect what's the problem). I met these errors:
Error while reading data, error message: CSV table encountered too many errors, giving up. Rows: 1; errors: 1. Please look into the errors[] collection for more details.
Error while reading data, error message: CSV table references column position 77, but line starting at position:0 contains only 56 columns.
My sample data is: https://drive.google.com/file/d/1v8jcIKSY7HiOpdc40BFJXACvgX8prWm0/view?usp=sharing

Please use the following steps to resolve the issue:
Download the file from Google Drive
Open the File and Save As "CSV UTF-8 (Comma delimited) (*.csv)"
Open BigQuery and upload CSV file with "Auto detect" schema

Pentaho - JSON Input not looping over files

So I've got about 10 JSON files that I have to stuff into an Elasticsearch setup. I have 3 steps currently, "Get file names", "JSON Input", and "Elasticsearch bulk insert". When I look at the Step metrics, I see that Get File Names is correctly reading the 10 files. But when it comes to the JSON input, only the first file is processed. What could be going on.
Here is an image of my setup, and I've attached the ktr file.
Link to the ktr file as it stands currently
Any help is greatly appreciated.

In the Content tab of the step you have the "Limit" atribute set to 1, you can Edit this by unchecking the "Source is from a previous step" option in the File tab, then you set "Limit" to 0.

Can I execute .sql file from SQLite command line when I don't have a .db file?

I've been writing SQL in environments where the databases and tables are all easy to pull in using simple 'FROM db.table'. Now I'm trying to do my own project on .csv files. I want to be able to write all of my queries in .sql files and execute them using command line.
I'm uncertain about the following:
What the best program to use is.
Wow to execute a .sql file from command line.
How to import a .csv file.
Yipes! I'm new to using command line and I'm new to pulling in my own tables.
I'm currently trying out SQLlite3, but from the documentation* it doesn't look like I can simply execute a .sql file using SQLlite3 in command line.
I've tried running "sqlite3 HelloWorld.sql" in command line for a file that just has "SELECT 'Hello World';" in it and this is what I get:
SQLite version 3.9.2 2015-11-02 18:31:45
Enter ".help" for usage hints.
sqlite>
Any advice would be greatly appreciated!
https://www.sqlite.org/cli.htmlb

On Windows you can execute SQL (files) via the command line:
>sqlite3 "" "SELECT 'Hello World!';"
Hello World!
>sqlite3 "" ".read HelloWorld.sql"
Hello World!
This won't create a database file because the first parameter is empty ("") and would normally yield in a database file.

How can I create an output as csv file in HP Exstream?

Hi I am new to HP Exstream. Trying to create an output in csv instead of pdf.
In the document could not find any indication. Please help

HP Exstream is used to generate output in AFP, PDF, HTML, PCL, etc. It doesnt support generating CSV as a output/output queue.
But, under "Data file" of an "Application" a report file can be generated. This report file can be delimited file. You can add any delimiter you want.

Data Pipeline error on a template from RDS to S3 copy

I am using the template to do a copy. In the "S3" data node, I have configured Custom format because my column data contains "comma".
The custom data format I am using has the following properties
Column seperator " is PIPE ("|") and Record seperator is "\n"
When I run the pipeline data does not get loaded into S3 and gives me the following error
Error copying record\nCause: java.lang.reflect.InvocationTargetException\nCause: null\nCause: Quote character must be defined in record format
Anyone faced this problem.

"Escape Char" option is not shown under "Custom" format type in AWS Data pipeline
You can use "TSV" type as your custom format type and provide:
"Column separator" as pipe(|),
"Record separator" as new line(\n),
"Escape Char" as backslash(\) or any other character you want.

If you are using Aurora as source database, I suggest you use SELECT INTO OUTFILE S3 command. Otherwise, build your own using EMR (hive and sqoop projects will help you with this).

We Keep Coding

sql objective-c vba vb.net react-native apache vue.js tensorflow api pandas

error writing from csv to mysql using pentaho pdi - pentaho

Related

Can't update CSV file to BQR

Pentaho - JSON Input not looping over files

Can I execute .sql file from SQLite command line when I don't have a .db file?

How can I create an output as csv file in HP Exstream?

Data Pipeline error on a template from RDS to S3 copy

Categories

Resources