I want to remove all of the following characters from a field: * - : .
Currently I can do this by chaining TRANSLATE statements together, but I hope there's an easier way.
INPUT
SELECT field FROM myTable
| field |
=========
| asdf* |
| as.df |
| a-sdf |
| :asdf |
DESIRED OUTPUT
| field |
=========
| asdf |
| asdf |
| asdf |
| asdf |
MY CODE
SELECT TRANSLATE(TRANSLATE(TRANSLATE(TRANSLATE(field, '*', ''), '-', ''), ':', ''), '.', '')
translate() takes longer strings:
select translate(field, '*-:.', '')
You might be confusing it with replace(), where you would need to nest the functional calls.
Related
How can I check whether a certain substring (for instance 18UT) is part of a string in a column?
Redshifts' SUBSTRING function allows me to "cut" a certain substring based on a starting index + length of the subtring, but not check whether a specific substring exists is in the column's value.
Example:
+------------------+
| col |
+------------------+
| 14TH, 14KL, 18AB |
| 14LK, 18UT, 15AK |
| 14AB, 08ZT, 18ZH |
| 14GD, 52HG, 18UT |
+------------------+
Desired result:
+------------------+------+
| col | 18UT |
+------------------+------+
| 14TH, 14KL, 18AB | No |
| 14LK, 18UT, 15AK | Yes |
| 14AB, 08ZT, 18ZH | No |
| 14GD, 52HG, 18UT | Yes |
+------------------+------+
Here is one option:
select col,
case when ', ' || col || ', ' like '%, 18UT, %' then 'yes' else 'no' end has_18ut
from mytable
While this will solve your immediate, problem, it should be note that storing delimited lists in a database table is bad practice, and should be avoided. Each value should go to a separate row instead.
I want to remove the ".com" suffix from the domain name field which is an array of strings separated by a pipe(|)
The data looks like below-
ID domain
1 ab23c45 | xyz167a.com | d1ef76om.com |rx08bj23
2 omg23hy5 | xyz167a | ab23c45.com | jhy2ft3.com
The result should look like-
ID domain
1 ab23c45 | xyz167a | d1ef76om |rx08bj23
2 omg23hy5 | xyz167a | ab23c45 | jhy2ft3
Below is for BigQuery Standard SQL
#standardSQL
SELECT id,
(
SELECT STRING_AGG(TRIM(REPLACE(TRIM(domain), IFNULL(NET.PUBLIC_SUFFIX(TRIM(domain)), ''), ''), '.'), ' | ')
FROM UNNEST(SPLIT(domain, '|')) domain
) domain
FROM `project.dataset.table`
if to apply to sample data from your question - output is
Row id domain
1 1 ab23c45 | xyz167a | d1ef76om | rx08bj23
2 2 omg23hy5 | xyz167a | ab23c45 | jhy2ft3
Note: above code handles any suffix - not just '.com'
I have a table column like below:
| cloumn_a |
| ------------------ |
| Alpha_Black_1 |
| Alpha_Black_2323 |
| Alpha_Red_100 |
| Alpha_Blue_2344 |
| Alpha_Orange_33333 |
| Alpha_White_2 |
| |
Usually, when I want to split with any symbol or character I am using the split_part(text, text, integer) so split_part(column_a, '_', 1)
I need to remove the numeric part of each variable and keep only the text part like Alpha_Black.
I cannot use the trim function because the numeric part can change
How can I skip the first underscore and split from the second one?
I would suggest using REGEXP_REPLACE here:
SELECT
column_a,
REGEXP_REPLACE(column_a, '_\\d+$', '') AS column_a_out
FROM yourTable;
Demo
I am a beginner in SQL language and I am using postgre sql and doing little exercices to learn. I have a column of strings named acronym from a destination table:
DO1
ES1
ES2
FR1
FR10
FR2
FR3
FR4
FR5
FR6
FR7
FR8
FR9
GP1
GP2
IN1
IN2
MU1
RU1
TR1
UA1
I would like to add a padding zero for acronym numbers that have only one digit, output:
DO01
ES01
ES02
FR01
FR02
FR03
FR04
FR05
FR06
FR07
FR08
FR09
FR10
GP01
GP02
IN01
IN02
MU01
RU01
TR01
UA01
How can I get to the left of the first number in the string? There is some regex I think but I did not figure it out
You can use the rpad() function to add characters to the end of the value:
select rpad(col, '0', 4)
In your case, though, you want a value in-between. On simple method is -- assuming that the first two characters are strings -- is:
(case when length(col) = 3
then left(col, 2) || '0' || right(col, 1)
else col
end)
Another possibility is using regexp_replace():
regexp_replace(col, '^([^0-9]{2})([0-9])$', '\10\2')
Both of these assume that the strings to be padded are three characters, which is consistent with your data. It is unclear what you want for other lengths.
try with below:
to_char() function
select to_char(column1, 'fm000') as column2
from Test_table;
fm "fill mode"prefix avoids leading spaces in the resulting var char.
000 it defines the number of digits you want to have.
You can use string functions like lpad(), substr(), left():
select
concat(left(columnname, 2), lpad(substr(columnname, 3), 2, '0')) result
from tablename
See the demo.
Results:
| result |
| ------ |
| DO01 |
| ES01 |
| ES02 |
| FR01 |
| FR10 |
| FR02 |
| FR03 |
| FR04 |
| FR05 |
| FR06 |
| FR07 |
| FR08 |
| FR09 |
| GP01 |
| GP02 |
| IN01 |
| IN02 |
| MU01 |
| RU01 |
| TR01 |
| UA01 |
I have a column with data as abc.123, def.345 and so on. It is basically a name followed by a . and then a code. I am able to divide them using
select
LEFT(Campaign, ISNULL(NULLIF(CHARINDEX('.', Campaign + ' ') -1, -1), LEN(Campaign))),
STUFF(Campaign, 1, Len(Campaign) +1- CHARINDEX('.',Reverse(Campaign)), '')
from myTable;
Input set:
| myColumn |
| abc.123 |
| def.345 |
| 444 |
However, for data like '444' it shows:
|Name| Code |
|abc | 123 |
|def | 345 |
|444 | | (SHOULD BE => | | 444 |)
Assumptions:
Data can be without a '.' In such case we can insert the data in either Name or Code depending on datatype ie Name=>Alphanumeric, Code=>Numeric.
eg: Data like
999 => | | 999 |
a24.345 => | a24 | 345 |
a72 => | a72 | |
Use CASE expressions together with the LIKE operand in order to differentiate the cases:
SELECT
CASE
WHEN Campaign LIKE '[a-z]%' THEN LEFT(Campaign, CHARINDEX('.', Campaign + '.') - 1)
ELSE null
END AS Name,
CASE
WHEN Campaign LIKE '[0-9]%' THEN Campaign
WHEN Campaign LIKE '%.[0-9]%' THEN
RIGHT(Campaign, LEN(Campaign) - CHARINDEX('.', Campaign))
ELSE null
END AS Code
FROM myTable
You can see an example here: http://sqlfiddle.com/#!3/ae7ef7/1
Use a case statement:
select (case when myColumn like '%.%'
then <your code here>
else myColumn
end) as code