Excel - remove duplicate rows from a list - excel-2007

I have not got much experience in Excel.Any help appreciated.
I have 10-11 csv files , that can be opened in excel. I have duplicate records in all the files.I need to count the unique records in all the files that matches column3(for e.g).
cheers

You could link the CSV files from an Access database, then UNION all the tables, that would get rid of any duplicate rows, then a simple query on the UNION would allow you to count rows with specific criteria.
Does that make sense?

try using this formula to identify duplicate and non-duplicate records. Here I'm assuming that all of your data is on one sheet, and the unique value to check for duplicate-iness is col3. Also, Im checking all the way down to row 2000. If you have more rows, change the range in the formula.
=IF(MAX(COUNTIF($C$2:$C$2000,$C$2:$C$2000))>1,"Duplicates","No Duplicates")
No all you gotta do is count how many Duplicates and how many No Duplicates you have.

Related

MS Access/SQL--Transitioning Columns to Rows for Dynamic Data

I have a document list that is updated periodically. The list may have up to 5 associated doc numbers associated with it. I need to generate a report that will have the primary document listed and the other 'sub documents' listed as well.
I basically need a table as show in the picture. I attempted a UNION, but if the data is dynamic, I would also continually need to update the query.
Any help or direction with this concept would be greatly appreciated.
I do not see other solution but read the recordset row by row and insert the data into new table. PIVOT would not work in this case.

How to Stack a range of values (from multiple tables in another sheet) into a single column

I'm working on a quarterly report that Auto-generates all fields.
I could really use some help building a formula that pulls values from the first column ([T6-TOC]) of three separate tables (ROVH_Jan, ROVH_Feb, ROVH_MAR) existing in another worksheet (RVH 1825). I need the three ranges of values to stack in a single column, but I do not want to eliminate duplicates values.
I've tried using =INDEX formula, and VBA but I can't get the syntax right.
Any suggestions?
These are sources I've viewed but didn't solve my problem.
https://superuser.com/questions/445410/pull-row-of-data-from-one-place-in-spreadsheet-to-another
http://forum.chandoo.org/threads/merge-stack-multiple-named-ranges-across-multiple-worksheets-in-a-master-sheet.11074/
Excel - Combine multiple columns into one column
http://www.mrexcel.com/forum/excel-questions/610527-how-do-i-stack-data-multiple-columns-into-one-column.html
Something like this should work for you:
=IF(ROW(A1)<=ROWS(ROVH_Jan),INDEX(ROVH_Jan[T6-TOC],ROW(A1)),IF(ROW(A1)<=ROWS(ROVH_Jan)+ROWS(ROVH_Feb),INDEX(ROVH_Feb[T6-TOC],ROW(A1)-ROWS(ROVH_Jan)),IF(ROW(A1)<=ROWS(ROVH_Jan)+ROWS(ROVH_Feb)+ROWS(ROVH_MAR),INDEX(ROVH_MAR[T6-TOC],ROW(A1)-ROWS(ROVH_Jan)-ROWS(ROVH_Feb)),"")))

Checking for same value in rows and calculate corresponding total

I have a large s/sheet. Values in column A correspond to values in column B, C & D.
I need to combine some rows which have same value in column A and automatically calculates total of value in column B in all corresponding rows.
Then i need to delete all unnecessary rows
Any ideas how i can do this with some code?
I think that you can use Power Query or VBA. Probably you will be able to achieve this with formulas, but it will be not flexible. With Power QUery you can combine data from multiple sources, clean and transform and even load directly into PowerPivot model. If you will have some detailed information please let me know. If you can upload the sample workbook with your data i will be able to provide you some more information.

best aproach to distinct query result before importing it into Excel cells

I have a query which grabs data from Access with contents:
A,1,z
A,2,z
B,1,y
A,1,i
I created 3 dropdowns in excel and I want to populate them with rows from these columns. The problem is that in each column there are duplicates and I want to get rid of them. I am looking for a solution to get rid of these duplicates.
Current process is following:
run a query. I use VBA in Excel. Access is being queried.
paste results into a separate sheet (it takes A LOT of time because there are 20k reocrds)
assign a range for my dropdows
As you can see my second step is very resource hog and the proccess time should drop drastically by removing duplicates from each column.
What is the best approach to populate dropdowns with unique values from query?
My ideas
create a query which will output me:
A,1,z
B,2,y
, ,i
In that case i will not have to distinct values manually in excel. Not possible to do, as I understand...
Add values from each columns to List, remove doubles, paste result into excel. This is my personal favorite because I see no other ways to fix the issue.
create multiple queries to DISTINCT each table_column separately.. Not very fast solution, I suppose
some other approach
Run three SELECT DISTINCT Colx FROM table queries to get the values for the three dropdowns. Your option two. The work has to be done sometime so you might as well use the tools designed to do the work instead of reinventing that wheel.
If you use select distinct columna, columnb... instead of select columna, columnb... you will get what you want. The SQL will take longer to run, though.

Find number of Unique Values per Item in Column B

I have a Column B with part numbers. These may repeat and can occur in the same warehouse, (Column A) and also different warehouses. I want to find the amount of times any particular part number appears in a unique warehouse.
I have tried a PivotTable and do not know if I need a macro or just a formula.
How might I achieve this?
Sorry, I think Remove Duplicates and Filter Advanced may not be your best bet. But as I understand your requirement a PT should suit: