SQL Plus show first four and last 4 numbers in the output - sql

How can I select a columns first 4 digits and last 4 digits and use an "X" placement for all the numbers in between?
Example
SELECT id from users where user_name ='Tom';
Output
5958694850384567
I am trying to get only the first and last 4 numbers with x's as placements to any number that is being masked:
Trying to get it to look like
Output:5958XXXXXXXX4567
Here is my query so far:
SELECT SUBSTR(id, 1, 4) from users
where user_name ='Tom'
Thank you for your time!

Have you considered simply use the LEFT() and RIGHT() functions? These will give you a specific number of characters from the left or right of a given string respectively.
You can also combine those to build your complete string using the CONCAT() function:
SELECT CONCAT(LEFT(id, 4), 'XXXXXXXX', RIGHT(id, 4))
FROM users
WHERE user_name = 'Tom'
Additionally, if you don't always have a given number of characters within the string, you could calculate the middle section of your output as well via the REPLICATE() function and a bit of math:
SELECT CONCAT(LEFT(id, 4), REPLICATE('X', LEN(id) - 8), RIGHT(id, 4))
FROM users
WHERE user_name = 'Tom'
Oracle Version
I didn't realize that you were using Oracle specifically and assumed SQL Server, so I'll provide some similar code to handle this in that flavor:
SELECT LEFT(id, 4) || RPAD('X', LEN(id) - 8, 'X') || RIGHT(id, 4)
FROM users
WHERE user_name = 'Tom'

Related

Select substring with dashes in SQL

I have this column called package_type and it contains strings like this:
TP-CYS01-01-2700-W-003
TP-CYS01-01-2700-W-004
TP-CYS01-02-2700-W-003
TP-CYS01-02-2700-W-001
I need to count the package_type but grouped by the package_no which is CYS01-01, CYS01-02.
What I have done is this:
select
substring(substring(package_type, '-', 3), '-', -2) as package_no,
count(distinct package_type)
from
project_june
where
progress = bill_of_quantity and event_date = '2020-06-12'
group by
substring(substring(package_type, '-', 3), '-', -2) as package_no
I get this error:
Invalid input syntax for integer: "-"
I expect the result would be like:
package_no count
------------------
CYS01-01 2
CYS01-02 2
How am I suppose to write this query?
Thank you.
The error message indicates that you are running Postgres. That database has powerful string functions - you can just use split_part():
select split_part(package_no, '-', 2) as package_no, count(distinct package_type) as cnt
from project_june
where progress = bill_of_quantity and event_date = date '2020-06-12'
group by split_part(package_no, '-', 2)
Notes:
Presumably, event_date is of a date-like datatype, so it should be compared against a literal date rather than a string.
I am suspicious about condition progress = bill_of_quantity; is bill_of_quantity an actual column in your table? If it is meant to be a literal string instead, then it should be surrounded with single quotes
Your strings seem to have a fixed format -- and you want 8 characters starting at the 4th position. That suggests that you can use:
select substring(package_type, 4, 8), as package_no,
count(distinct package_type)
from project_june
where progress = bill_of_quantity and
event_date = '2020-06-12'
group by substring(package_type, 4, 8);
There are no doubt other ways to write such a query. However, string functions are often database-specific and your question doesn't specify what database you are using.

Date is not displaying correct with substr & like query

I am trying to get this out out,
but i am experiencing that the substr i am using is incorrect ,
For an example , all my columns are displaying
hdfs://asdasda/asdas/fdsfdsfd/received_files/asdasd_20191231_11122333_123456789_CO.dat
some of which has more character so in order for me to get the exact date in the column is inconsistent if i am using subsring
some will return 20191230
but some will return _2020123
How do we tackle this problem ?
i am trying to display only data , this is using sql language or hue ,
when i input my script in ,
select SUBSTR(input_file_name, LENGTH(input_file_name) - 44, 9) from th_ingestion_status limit 100
i feel my script for Like and substr statement is incorrect
I you want the first sequence of 8 digits surrounded by underscores, use regexp_extract():
select regexp_extract(filename, '_([0-9]{8})_', 1)
If you need this after the last /, then:
select regexp_extract(filename, '_([0-9]{8})_[^/]*$', 1)
Please use below query, also please mention the database you are using, so that can provide relevant query
substr(column_name, instr(column_name, '_', 1, 2) +1, 6)
Oracle Test Case:
select 'hdfs://asdasda/asdas/fdsfdsfd/received_files/asdasd_20191231_11122333_123456789_CO.dat', substr('hdfs://asdasda/asdas/fdsfdsfd/received_files/asdasd_20191231_11122333_123456789_CO.dat', instr('hdfs://asdasda/asdas/fdsfdsfd/received_files/asdasd_20191231_11122333_123456789_CO.dat', '_', 1, 2) +1, 6)
from dual;

ORA-01722: invalid number - value with two decimals

I'm trying to get the max value from a text field. All but two of the values are numbers with a single decimal. However, two of the values have something like 8.2.10. How can I pull back just the integer value? The values can go higher than 9.n, so I need to convert this field into a number so that I can get the largest value returned. So all I want to get back is the 8 from the 8.2.1.
Select cast(VERSION as int) is bombing out because of those two values with a second . in them.
You may derive by using regexp_substr with \d pattern :
with tab as
(
select regexp_substr('8.2.1', '\d', 1, 1) from dual
union all
select regexp_substr('9.0.1', '\d', 1, 1) from dual
)
select * from tab;
For Oracle you must attend the value as string for retire only the part before the dot. Ex:
SELECT NVL( SUBSTR('8.2.1',0, INSTR('8.2.1','.')-1),'8.2.1') AS SR FROM DUAL;
Check than the value is repeated 3 times in the sentence, and if the value is zero or the value didn't have decimal part then it will return the value as was set.
I had to use T-SQL rather PL/SQL, but the idea is the same:
DECLARE #s VARCHAR(10);
SELECT #s='8.2.1';
SELECT CAST(LEFT(#s, CHARINDEX('.', #s) - 1) AS INT);
returns the integer 8 - note that it won't work if there are no dots because it takes the part of the string to the left of the first dot.
If my quick look at equivalent functions was correct, then in Oracle that would end up as:
SELECT CAST(SUBSTR(VERSION, 1, INSTR(VERSION, '.') - 1) AS INT)

How to use AVG() function after converting a substring to a numeric format?

In my example there are two columns like: "ID" (NUMBER) and "NUMBER_COUNT" (VARCHAR2) which currently look like:
ID: 1234567 -> NUMBER_COUNT = ':123:999:100:500:502:'
ID: 1111111 -> NUMBER_COUNT = ':444:543:100:664:404:'
I want to get the AVG() of NUMBER_COUNT grouped by ID, therefore the output should look like:
ID numbers_avg
1234567 444.8
1111111 431
I already tried this:
select
e.ID,
AVG(to_number(substr(e.NUMBER_COUNT, instr(e.NUMBER_COUNT, ':', 1, level) + 1,
instr(e.NUMBER_COUNT, ':', 1, level + 1) - instr(e.NUMBER_COUNT, ':', 1,
level) - 1),'99999999D99999')) as numbers_avg
from TABLE e
connect by level <= length(e.NUMBER_COUNT) - length(replace(e.NUMBER_COUNT, ':')) - 1
group by e.ID;
The statement can be executed. Unfortunately ORACLE neither displays any result, nor error.
Can someone provide help on this issue?
First of all, you shouldn't be storing the data this way. Normalizing the structure will be the best solution.
For now, you can use regexp_substr and also, fix you connect by to not create loops using sys_guid().
select
id,
avg(regexp_substr(number_count, '\d+', 1, level))
from your_table
connect by level <= regexp_count(number_count, ':')
and prior id = id
and prior sys_guid() is not null
group by id;
Demo
I have the sense that you were somehow trying to explode the colon separated string in the NUMBER_COUNT column, and then aggregate the numbers using GROUP BY. I think the easiest approach here might be to just use SUBSTR() and take the average of the 5 numbers, casting each number as needed.
SELECT ID, (CAST(SUBSTR(NUMBER_COUNT, 2, 3) AS NUMBER) +
CAST(SUBSTR(NUMBER_COUNT, 6, 3) AS NUMBER) +
CAST(SUBSTR(NUMBER_COUNT, 10, 3) AS NUMBER) +
CAST(SUBSTR(NUMBER_COUNT, 14, 3) AS NUMBER) +
CAST(SUBSTR(NUMBER_COUNT, 18, 3) AS NUMBER)) / 5 AS numbers_avg
FROM yourTable
Note that I would expect this approach to be faster than using REGEXP_SUBSTR(), which was suggested by others. I don't see the need for regex here, because Oracle's base string functions have the problem covered.
Demo here:
Rextester

Max of a part of split string

I have, in my DB oracle 10g, a field that contains references.
It's stored as : name/yyyy/mm/number
The new number, is the max number found in the part mm/number.
So, for now, I have a split of my string that gives me a list of str_array like this :
str_array(name, yyyy, mm, number)
I'd like, with this, found max number, for the couple mm/number.
Is this possible to do this?
Can I have something like :
SELECT MAX(split(reference, '/').lastPartOfArray) into nb
FROM table
where lastPartOfArray-1 = sysdate.month;
Data samples :
Smith/2013/12/1
Smith/2013/11/1
Smith/2013/12/3
Jones/2013/12/6
Smith/2013/12/3
Jones/2013/11/7
Since we are in the month 12, a max on those data must give me 6 into nb.
The number part, has no limit, it can be 1000, 10000...
The part Jones/2013 doesn't really matter for the number. But I can't have the same number, for a month.
My apologies, I don't know if this is possible, so I tried to write what I want in the query.
Is this possible, or should I create more than one field in my table(name/yyyy, mm, number)?
edit : valex answer and some custom
select MAX(CAST(SUBSTR(num,INSTR(num,'/')+9,1000) as Int))
from T
where num like TO_CHAR(sysdate,'%/YYYY/MM/%')
So this, count searching first occurence.
select MAX(CAST(SUBSTR(num,INSTR(num,'/',1 ,n)+1,1000) as Int))
from T
where num like TO_CHAR(sysdate,'%/YYYY/MM/%')
This found the n occurence of the char.
This is a helpful solution in other cases.
To get a maximum you should convert this last part into INT values otherwise you can get not right results because of STRING comparing rules will be used.
As soon as /YYYY/MM/ has got a fixed length = 9 so we can find first \ position and add 9 to this position to find a last part number substring start.
Here is an example:
select MAX(CAST(SUBSTR(num,INSTR(num,'/')+9,1000) as Int))
from T
where num like TO_CHAR(sysdate,'%/YYYY/MM/%')
SQLFiddle demo
Also you can exclude wrong formatted values from this query to avoid conversion errors using the following way:
select MAX(CAST(SUBSTR(num,INSTR(num,'/')+9,1000) as Int))
from T
where num like TO_CHAR(sysdate,'%/YYYY/MM/%')
AND
LENGTH(TRIM(TRANSLATE(SUBSTR(num,INSTR(num,'/')+9,1000),
' 0123456789', ' '))) is null
SQLfiddle demo
Try this:
SELECT
MAX(SUBSTR(num, INSTR(num, '/', 1, 3) + 1))
FROM ref
WHERE
SUBSTR(num, INSTR(num, '/', 1, 2) + 1, INSTR(num, '/', 1, 3) - INSTR(num, '/', 1, 2) - 1) = TO_CHAR(sysdate, 'MM')
Sample: http://sqlfiddle.com/#!4/1b03a/1