Splitting Data in a Column

Splitting Data in a Column - sql

I get some data that comes in to the table.
This table is currently only displaying after I merged all the tables. So you currently see:
Table 1
union
Table 2
union
Table 3
The issue I have is now i have one column in there where it contains data like this:
AA2B133
I want to split this column, so in the current column it tries to keep the first 3 charachters and the other 4 charachters it goes into another column.
What is the best way or simplest way of doing this.
Thank you

You can use the RIGHT() and LEFT() functions to split the data into multiple columns.
For example:
SELECT LEFT(data,3), RIGHT(data,4)
FROM (SELECT 'AA2B133' AS data) A
Will return two columns with the breakout you requested.

You can use Java to get the 3 first caracters using this
String substring (String s, int start, int len)
so if you want to get the three first charachters for example you can do like this
String substring ("AA2B133",0,2)

Related

How to extract all (including int and float) numerical values in a string column in Google BigQuery?

I have a table Table_1 on Google BigQuery which includes a string column str_column. I would like to write a SQL query (compatible with Google BigQuery) to extract all numerical values in str_column and append them as new numerical columns to Table_1. For example, if str_column includes first measurement is 22 and the other is 2.5; I need to extract 22 and 2.5 and save them under new columns numerical_val_1 and numerical_val_2. The number of new numerical columns should ideally be equal to the maximum number of numerical values in str_column, but if that'd be too complex, extracting the first 2 numerical values in str_column (and therefore 2 new columns) would be fine too. Any ideas?

Consider below approach
select * from (
select str_column, offset + 1 as offset, num
from your_table, unnest(regexp_extract_all(str_column, r'\b([\d.]+)\b')) num with offset
)
pivot (min(num) as numerical_val for offset in (1,2,3))
if applied to sample data like in your question - output is

Extract the highest key:value pair from a string in Standard SQL

I have the following data type below, it is a type of key value pair such as 116=0.2875. Big Query has stored this as a string. What I am required to do is to extract the key i.e 116 from each row.
To make things more complicated if a row has more than one key value pair the iteration to be extracted is the one with the highest number on the right e.g {1=0.1,2=0.8} so the extracted number would be 2.
I am struggling to use SQL to perform this, Particularly as some rows have one value and some have multiple:
This is as close as I have managed to get where I can create a bit of code to extract the highest right hand value (which I don't need) but I just cant seem to create something to either get the whole key/value pair which would be fine and work for me or just the key which would be great.
column
,(SELECT MAX(CAST(Values AS NUMERIC)) FROM UNNEST(JSON_EXTRACT_ARRAY(REPLACE(REPLACE(REPLACE(column,"{","["),"}","]"),"=",","))) AS Values WHERE Values LIKE "%.%") AS Highest
from `table`
Here is some sample data:
1 {99=0.25}
2 {99=0.25}
3 {99=0.25}
4 {116=0.2875, 119=0.6, 87=0.5142857142857143}
5 {105=0.308724832214765}
6 {105=0.308724832214765}
7 {139=0.5712754555198284}
8 {127=0.5767967894928858}
9 {134=0.2530120481927711, 129=0.29696599825632086, 73=0.2662459427947186}
10 {80=0.21242613001118038}
Any help on this conundrum would be greatly appreciated!

Consider below approach
select column,
( select cast(split(kv, '=')[offset(0)] as int64)
from unnest(regexp_extract_all(column, r'(\d+=\d+.\d+)')) kv
order by cast(split(kv, '=')[offset(1)] as float64) desc
limit 1
) key
from your_table
if applied to sample data in your question - output is

How can I "dynamically" split a varchar column by specific characters?

I have a column that stores 2 values. Example below:
| Column 1 |
|some title1 =ExtractThis ; Source Title12 = ExtractThis2|
I want to remove 'ExtractThis' into one column and 'ExtractThis2' into another column. I've tried using a substring but it doesn't work as the data in column 1 is variable and therefore it doesn't always carve out my intended values. SQL below:
SELECT substring(d.Column1,13,24) FROM dbo.Table d
This returns 'Extract This' but for other columns it either takes too much or too little. Is there a function or combination of functions that will allow me to split consistently on the character? This is consistent in my column unlike my length count.

select substring(col1,CHARINDEX('=',col1)+1,CHARINDEX (';',col1)-CHARINDEX ('=',col1)-1) Val1,
substring(col1,CHARINDEX('=',col1,CHARINDEX (';',col1))+1,LEN(col1)) Val2
from #data
there is duplicate calculation that can be reduced from 5 to 3 to each line.
but I want to believe this simple optimization done by SQL SERVER.

Split a value with delimiter length 3 '|||'

I have a PLSQL code to fetch data from a table with 30 rows out of which 10 columns are with delimiter of length 3 and I need to convert 1 row into multiple rows based on number of fields in those 10 columns.
So I am loading all Data into 1 temp table , and on temp table I am calling a cursor which will split data and insert multiple rows into main table.
Inside cursor I am using regexp_substr to split the Value and regular expression used by me is [^\\|]+{3} , I am not getting actual Values after splitting.
Sample data used for test case is
100|||200||300|||400||||0
After splitting I should get values as below
100, 200||300 , 400 , |0
But what I am getting is
100 , 200, 300 ,400, 0
Can any one suggest me the proper way to do it?
Waiting for reply!
Thanks

Try this. Hope it helps.
SELECT REPLACE('100|||200||300|||400||||0','|||',',') OUTPUT FROM DUAL;
----------------------------OUTPUT---------------------------------------------
OUTPUT
100,200||300,400,|0
-----------------------------------------------------------------------------

Regex (\|*\d.*?)\|{3}|(\|*\d$) captures what you're after here.

How do I get the last three digits of a varying length column?

I want to write a select statement and get the last three digits of all of the rows in a column for which the length varies.
Any ideas on how I can achieve this?
Hypothetical column:
12312398098098
127865275
I want the resulting column to have the values:
Resulting column after the script:
098
275

SELECT RIGHT(CONVERT(VARCHAR(4000), [hypothetical column]), 3) FROM table;
(Added a convert in case this is a numeric column.)

We Keep Coding

sql objective-c vba vb.net react-native apache vue.js tensorflow api pandas

Splitting Data in a Column - sql

You can use the RIGHT() and LEFT() functions to split the data into multiple columns. For example: SELECT LEFT(data,3), RIGHT(data,4) FROM (SELECT 'AA2B133' AS data) A Will return two columns with the breakout you requested.

You can use Java to get the 3 first caracters using this String substring (String s, int start, int len) so if you want to get the three first charachters for example you can do like this String substring ("AA2B133",0,2)

Related

How to extract all (including int and float) numerical values in a string column in Google BigQuery?

Extract the highest key:value pair from a string in Standard SQL

How can I "dynamically" split a varchar column by specific characters?

Split a value with delimiter length 3 '|||'

How do I get the last three digits of a varying length column?

Categories

Resources