SQL Concatenate Distinct Values - sql

I have the following query where I am concatenating a list of products and quotes by account and version number.
SELECT DISTINCT ST2.Account_No, ST2.version_num,
substring((SELECT ',' + ST1.ProductNo AS [text()]
FROM (SELECT DISTINCT Account_No, version_num, ProductNo, QuoteNo, RowNo
FROM uAccountProductInfo) ST1
WHERE ST1.version_num = ST2.version_num
AND ST1.Account_No = ST2.Account_No
ORDER BY ST1.RowNo, ST1.Account_No,ST1.version_num
FOR XML PATH (''))
, 2, 1000) [AllProduct]
,
substring((SELECT ','+ ST3.QuoteNo AS [text()]
FROM (SELECT DISTINCT Account_No, version_num, ProductNo, QuoteNo, RowNo
FROM [uAccountProductInfo]) ST3
WHERE ST3.version_num = ST2.version_num
AND ST3.Account_No = ST2.Account_No
ORDER BY ST3.RowNo, ST3.version_num
FOR XML PATH (''))
, 2, 1000) [AllQuote]
FROM uAccountProductInfo ST2
The problem I am experiencing is that the return values are not showing the distinct results. I understand the reason it's happening but cannot figure out how to adjust it.
Return results look this:
Account version_num AllProduct AllQuote
1 2 aaa,aaa,aaa 111,111,111
1 3 aaa,aaa,bbb 111,111,222
What I want is
Account version_num AllProduct AllQuote
1 2 aaa, 111
1 3 aaa,bbb 111,222
Test Data would be this:
Account version_num LOB Package Product Quote RowNo
1 2 GL 1 aaa 111 1
1 2 AU 1 aaa 111 2
1 2 PF 1 aaa 111 3
1 3 GL 1 aaa 111 1
1 3 AU 1 aaa 111 2
1 3 WK 0 bbb 222 3
The reason they are returning with multiple instances of the same product | quote is due to the inclusion of RowNo column. I had this excluded before which returned the distinct list of values, but I need to order by RowNo so that the values come in a specific order.
I have been wracking my brain all morning but cannot figure out how to adjust the query to only return the distinct values at the top level.
Any suggestions?
n.b. - this is part of a larger query but once this subquery is resolved it should flow into main one just fine (at least I think). I can post main query if people need.

I much prefer stuff() rather than substring() for removing the separating character. What you require, though, is select distinct or group by in the subquery:
stuff((SELECT ',' + ST1.ProductNo AS [text()]
FROM uAccountProductInfo ST1
WHERE ST1.version_num = ST2.version_num AND
ST1.Account_No = ST2.Account_No
GROUP BY ST1.ProductNo
ORDER BY MIN(ST1.RowNo)
FOR XML PATH ('')
), 1, 1, '')
Your additional subquery is superfluous. In fact, it is misleading because you are using SELECT DISTINCT and expect it to return one row per ProductNo -- even when multiple rows exist with different values in the other columns.
Note that the ordering is unclear. This bases it on the minimum RowNo.

Related

How to Condense SQL Rows Using Stuff?

I have a table of data that looks like:
#test
RecordID
Name
hasSpanishVersion
Type
TypeID
1
Test One
Yes
FormType1
1
1
Test One
Yes
FormType2
2
3
Test Three
No
null
null
4
Test Four
Yes
FormType3
3
5
Test Five
Yes
FormType3
3
I also have another table that looks like:
#formTypes
TypeID
FormType
1
FormType1
2
FormType2
3
FormType3
What I am trying to do is condense the Type column where there are like-RecordIDs / Names. If "hasSpanishVersion" is null, the following two columns will also be null.
I am wanting the example table to look like:
RecordID
Name
hasSpanishVersion
Type
1
Test One
Yes
FormType1, FormType2
3
Test Three
null
null
4
Test Four
Yes
FormType3
5
Test Five
Yes
FormType3
I have tried the following code, but this only takes all of the FormTypes and condenses them for each of the three different types:
SELECT
*,
STUFF((SELECT '; ' + t.formTypeSpanish
FROM #test t
WHERE t.TypeID = ft.TypeID
FOR XML PATH('')), 1, 1, '') as FormTypes
FROM #formTypes ft
GROUP BY ft.TypeID, ft.FormType
ORDER BY 1
You might let your grouping column put in Where in a correlated subquery that and you concatenate value in SELECT
SELECT
RecordID, Name,hasSpanishVersion,
STUFF((SELECT ',' + tt.[Type]
FROM formTypes tt
WHERE
tt.RecordID = t1.RecordID AND
tt.Name = t1.Name AND
tt.hasSpanishVersion = t1.hasSpanishVersion
FOR XML PATH('')), 1, 1, '') as FormTypes
FROM formTypes t1
GROUP BY RecordID, Name,hasSpanishVersion
ORDER BY 1
if your sql-server support STRING_AGG there is another simple way to do that.
SELECT RecordID, Name,hasSpanishVersion,STRING_AGG([Type] ,',')
FROM formTypes
GROUP BY RecordID, Name,hasSpanishVersion
sqlfiddle

Concatenate multiple rows of data into 1 column, but grouping it based on two primary keys

can anyone help me combine rows of stock location into 1 column
based on branch and productid primary keys
im not sure what i can try as the code below just runs and runs
do i need to self join to get the result? is there a better method than below?
before
branch productid stock-location
1 5 b5
1 3 b2
1 5 b4
after
branch productid stock-location
1 5 b5,b4
1 3 b2
here is my very bad code
SELECT Main.zone,
LEFT(Main.zone,Len(Main.zone)-1) As "ZONE_CODE"
FROM
(
SELECT DISTINCT ST2.LOCATION_NBR,
(
SELECT ST1.ZONE_CODE + ',' AS [text()]
FROM dbo.OPAL_ZONE ST1
WHERE ST1.LOCATION_NBR = ST2.LOCATION_NBR
ORDER BY ST1.LOCATION_NBR
FOR XML PATH ('')
) zone
FROM dbo.OPAL_ZONE sT2
) [Main]
I figured it out, sorry for the dud question, but i hope my answer can help someone else
Main.Zone_Loc As "ZONE_CODE"
FROM
(
SELECT DISTINCT ST2.LOCATION_NBR, st2.PRODUCT_CODE,
(
SELECT ST1.ZONE_CODE + ',' AS [text()]
FROM dbo.OPAL_ZONE ST1
WHERE ST1.LOCATION_NBR = ST2.LOCATION_NBR and ST1.PRODUCT_CODE = sT2.PRODUCT_CODE
ORDER BY ST1.LOCATION_NBR
FOR XML PATH ('')
) [Zone_Loc]
FROM dbo.OPAL_ZONE sT2
) [Main]

Get the mentioned output by SQL query

I appeared for an interview lately. The interviewer asked me the problem.
I have 2 tables:
First table is Location like this:
ID | City
---+-----------
1 | Mumbai
2 | Delhi
3 | Bangalore
Second table is Item like this:
Item | Location_id
-----+-------------
A | 1,2
B | 2,3
C | 1,2,3
Now we want the output as below
Item | Location
-----+-------------------------
A | Mumbai,Delhi
B | Delhi,Bangalore
C | Mumbai,Delhi,Bangalore
Please help me write the query.
You can use below query . String split works on SQL server 2016 and later versions.
Select * into #temp
from #location l
join (
select item,value from #item
CROSS APPLY STRING_SPLIT(location_id, ',')
) A on l.id=a.value
select
item,
stuff((
select ',' + u.city
from #temp u
where u.item = A.item
for xml path('')
),1,1,'') as List
from #temp A
group
by item
Drop table #temp
You want to STUFF all matching results into one column:
SELECT i.item,
STUFF((SELECT ',' + l.city
FROM location l
WHERE CHARINDEX(cast(l.id AS VARCHAR(4)),i.location_id)>0
FOR XML PATH ('')), 1, 1, '')
AS Location
FROM item i;
CHARINDEX finds the first instance of a substring in a string, in this case it finds that you have the number 1 for Mumbai, 2 for Dehli etc in your string of location_id.
STUFF combines your results into a single result.
SQLFiddle
UPDATE -
As pointed out by Rajneesh, this only works because your IDs are single digit. String splitting is probably the best way to handle the possibility of such IDs. That can still be done within this one query, without the need for a temp table.
SELECT i.item,
STUFF((SELECT ',' + l.city
FROM location l
WHERE l.id IN (select value from STRING_SPLIT(i.location_id, ','))
FOR XML PATH ('')), 1, 1, '')
AS Location
FROM item i;
SQLFiddle

Returning One Row with Multiple Column Positions as a List

My problem seems pretty straight forward, but I'm struggling to grasp an easier way of fulfilling the required output.
Problem: I have entries in a table. These entries are identified by an ID column. They are joined with a table that shows a particular volunteer position that they hold or have held. If they are identified as having more than one position, their record returns multiple times.
Solution: I would like to list their positions in one column so that their entry based on the ID column returns only once. I have tried using STUFF and CONCAT, but all they return are multiples of the same position held for each entry as opposed to finding out if they have multiple positions and listing them, which means I'm probably using those functions incorrectly.
Here is the current output:
ID FIRST_NAME current_flag Position
---------- -------------------- ------------ ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
106379 Christine 1 NOMAL
106379 Christine 1 SFC
106418 Mary 1 CVP
106751 Denise 1 SDFA
106885 Marianna 1 RCRA
107244 Jennifer 1 RCF
Here is the desired output:
ID FIRST_NAME current_flag Position
---------- -------------------- ------------ ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
106379 Christine 1 NOMAL,SFC
106418 Mary 1 CVP
106751 Denise 1 SDFA
106885 Marianna 1 RCRA
107244 Jennifer 1 RCF
I have removed extraneous columns because the output is not conducive to showing what the desired output would be, but they are included in my code as follows.
Current Code:
SELECT DISTINCT Name.ID,
Name.FIRST_NAME,
Name.LAST_NAME,
Name_Address.ADDRESS_1,
Name_Address.CITY,
Name_Address.STATE_PROVINCE,
Name_Address.ZIP,
Name_Address.PREFERRED_MAIL,
vcsiboCurrentCommittee.current_flag,
CONVERT(VARCHAR(10), Name.BIRTH_DATE, 101) AS BirthDate,
vcsiboCurrentCommittee.Position
FROM Name INNER JOIN
Name_Address ON Name.ID = Name_Address.ID INNER JOIN
vcsiboCurrentCommittee ON Name.ID = vcsiboCurrentCommittee.ID
WHERE (Name_Address.PREFERRED_MAIL = 1.00)
AND (vcsiboCurrentCommittee.current_flag = 1.00)
AND (Name.ID <> 10) AND (Name.BIRTH_DATE <> '')
AND (vcsiboCurrentCommittee.CommitteeCode IN
('ALUMNAE_DEPT', 'COLLEGIATE-DEPT', 'EDUCATION', 'FINANCIAL', 'INTERN_COUNCIL', 'IPDEPT', 'MEMBERSHIP','PANHELLENIC', 'REG_1', 'REG_2', 'REG_3', 'REG_4', 'REG_5', 'REG_6', 'REG_7', 'REG_8'))
Code using STUFF:
SELECT DISTINCT Name.ID,
Name.FIRST_NAME,
Name.LAST_NAME,
Name_Address.ADDRESS_1,
Name_Address.CITY,
Name_Address.STATE_PROVINCE,
Name_Address.ZIP,
Name_Address.PREFERRED_MAIL,
vcsiboCurrentCommittee.current_flag,
CONVERT(VARCHAR(10), Name.BIRTH_DATE, 101) AS BirthDate,
STUFF(
(SELECT
CAST(',' AS varchar(max)) + vcsiboCurrentCommittee.Position
FROM vcsiboCurrentCommittee AS vcc
WHERE vcc.ID = Name.ID
ORDER BY vcc.Position
FOR xml path('')
), 1, 1, '') AS Positions,
vcsiboCurrentCommittee.Position
FROM Name INNER JOIN
Name_Address ON Name.ID = Name_Address.ID INNER JOIN
vcsiboCurrentCommittee ON Name.ID = vcsiboCurrentCommittee.ID
WHERE (Name_Address.PREFERRED_MAIL = 1.00) AND
(vcsiboCurrentCommittee.current_flag = 1.00)
AND (Name.ID <> 10) AND (Name.BIRTH_DATE <> '')
AND (vcsiboCurrentCommittee.CommitteeCode IN ('ALUMNAE_DEPT', 'COLLEGIATE-DEPT', 'EDUCATION', 'FINANCIAL', 'INTERN_COUNCIL', 'IPDEPT', 'MEMBERSHIP', 'PANHELLENIC', 'REG_1', 'REG_2', 'REG_3', 'REG_4', 'REG_5', 'REG_6', 'REG_7', 'REG_8'))
That creates this ouput:
ID FIRST_NAME current_flag Positions Position
---------- -------------------- ------------ ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
106379 Christine 1 NOMAL,NOMAL,NOMAL,NOMAL,NOMAL,NOMAL,NOMAL,NOMAL,NOMAL,NOMAL NOMAL
106379 Christine 1 SFC,SFC,SFC,SFC,SFC,SFC,SFC,SFC,SFC,SFC SFC
106418 Mary 1 CVP,CVP,CVP,CVP,CVP,CVP,CVP,CVP,CVP,CVP,CVP,CVP,CVP CVP
106751 Denise 1 SDFA,SDFA,SDFA,SDFA,SDFA,SDFA,SDFA,SDFA,SDFA,SDFA,SDFA
Thanks in advance.
SQL Server 2012
I think you need to fix the alias in the subquery and perhaps select distinct:
STUFF((SELECT DISTINCT CAST(',' AS varchar(max)) + vcc.Position
---------------------------------------------------^
FROM vcsiboCurrentCommittee AS vcc
WHERE vcc.ID = Name.ID
ORDER BY vcc.Position
FOR xml path('')
), 1, 1, '') AS Positions,
Once you fix the alias (so Position refers to the inner table rather than the outer table), then the DISTINCT might not be needed. If you do use DISTINCT, the ORDER BY might need to be adjusted as well.
EDIT:
The duplicates in the outer query are created by the logic in the outer query. You probably don't want the same table there as in the subquery. It is very hard to tell what logic you really want. Probably some sort of aggregation on vcsiboCurrentCommittee before joining it to other tables.

Combining Data From Multiple Rows

I have 3 tables I am writing a query for: Memos, Memos_Description, Policies. The database was not designed by myself and I cannot change it, this is simply a report.
I currently have a query that seems to be working, but is extremely inefficient before joining the extra tables that I need.
SELECT
Main.CLIENTSNAME,
Main.ENTRYDATE,
Main.AUTHOR,
Main.POLICYNUMBER,
Main.CLIENTS_ID,
Main.MEMOS_ID,
Left(Main.DESCRIPTION,Len(Main.DESCRIPTION)) AS REGARDING
FROM
(
SELECT distinct ST1.MEMOS_ID,
(
SELECT ST2.DESCRIPTION + ' ' AS [text()]
FROM dbo.MEMOS_DESCRIPTION ST2
WHERE ST1.MEMOS_ID = ST2.MEMOS_ID
ORDER BY ST1.MEMOS_ID
For XML PATH ('')
) [DESCRIPTION],
ST1.CLIENTSNAME,
ST1.ENTRYDATE,
ST1.AUTHOR,
ST1.POLICYNUMBER,
ST1.REGARDING,
ST1.CLIENTS_ID
FROM dbo.MEMOS ST1
) [Main]
The tables look like this:
tbl.MEMOS
MEMOS_ID
POLICIES_ID
CLIENTSNAME
tbl.MEMOS_DESCRIPTION
MEMOS_ID
DESCRIPTION
tbl.POLICIES
POLICIES_ID
POLICYNUMBER
The data looks like this:
tbl1.MEMOS_ID | tbl1.CLIENTSNAME
1 PERSON ONE
2 PERSON TWO
3 PERSON THREE
tbl2.MEMOS_ID | tbl2.DESCRIPTION
1 This is a sentence
1 that can run over more
1 than one description record.
2 Person two has
2 something different.
3 Client Created.
tbl3.POLICIES_ID | tbl3.POLICYNUMBER
123 ABCDE
456 FGHIJ
I would like the report to look like:
tbl1.MEMOS_ID | tbl1.CLIENTSNAME | tbl2.DESCRIPTION | tbl3.POLICIES_ID | tbl3.POLICYNUMBER
1 PERSON ONE This is a sentence that can run over more tan one description record. 123 ABCDE
I hope this makes sense and thank you.
Updated Query as per Gordon's suggested answer:
SELECT ST1.*,
STUFF(
(SELECT ' ' + ST2.DESCRIPTION AS [text()]
FROM dbo.MEMOS_DESCRIPTION ST2
WHERE ST1.MEMOS_ID = ST2.MEMOS_ID
ORDER BY ST1.MEMOS_ID
For XML PATH ('')
), 1, 1, '') [REGARDING]
FROM
(SELECT DISTINCT ST1.MEMOS_ID,
ST1.CLIENTSNAME,
ST1.ENTRYDATE,
ST1.AUTHOR,
ST1.POLICYNUMBER,
ST1.CLIENTS_ID,
ST1.POLICIES_ID
FROM dbo.MEMOS ST1
) ST1
LEFT JOIN POLICIES B
ON ST1.POLICIES_ID = B.POLICIES_ID
WHERE ST1.ENTRYDATE >= DATEADD(month, -2, GETDATE())
AND (B.PROD1 = ('123') OR B.PROD1 = ('456') OR B.PROD1 = ('789'))
How does the performance compare if you do the select distinct before doing the string aggregation?
SELECT ST1.*,
STUFF((SELECT ' ' + ST2.DESCRIPTION AS [text()]
FROM dbo.MEMOS_DESCRIPTION ST2
WHERE ST1.MEMOS_ID = ST2.MEMOS_ID
ORDER BY ST1.MEMOS_ID
For XML PATH ('')
), 1, 1, '') ) [DESCRIPTION]
FROM (SELECT DISTINCT T1.MEMOS_ID, ST1.CLIENTSNAME, ST1.ENTRYDATE,
ST1.AUTHOR, ST1.POLICYNUMBER, ST1.REGARDING, ST1.CLIENTS_ID
FROM dbo.MEMOS ST1
) ST1
I suspect that SQL Server might be doing the string aggregation for every row before running distinct -- and that is a lot of unnecessary work.