如何在Oracle中查找表中的重复值?

在Oracle数据库表中返回给定列的重复值及其出现次数的最简单SQL语句是什么?

例如:我有一个JOBS表,列为JOB_NUMBER。我怎样才能知道我是否有任何重复的__abc1,以及它们重复了多少次?

802748 次浏览

按COUNT聚合列,然后使用HAVING子句查找出现次数大于一次的值。

SELECT column_name, COUNT(column_name)
FROM table_name
GROUP BY column_name
HAVING COUNT(column_name) > 1;

我能想到的最简单的:

select job_number, count(*)
from jobs
group by job_number
having count(*) > 1;

如何:

SELECT <column>, count(*)
FROM <table>
GROUP BY <column> HAVING COUNT(*) > 1;

要回答上面的例子,它看起来像:

SELECT job_number, count(*)
FROM jobs
GROUP BY job_number HAVING COUNT(*) > 1;

select count(j1.job_number), j1.job_number, j1.id, j2.id
from   jobs j1 join jobs j2 on (j1.job_numer = j2.job_number)
where  j1.id != j2.id
group by j1.job_number

将给出复制行的id。

另一种方法:

SELECT *
FROM TABLE A
WHERE EXISTS (
SELECT 1 FROM TABLE
WHERE COLUMN_NAME = A.COLUMN_NAME
AND ROWID < A.ROWID
)

column_name上有索引时工作正常(足够快)。它是删除或更新重复行的更好方法。

如果您不需要知道重复的实际数量,则甚至不需要在返回列中显示计数。如。

SELECT column_name
FROM table
GROUP BY column_name
HAVING COUNT(*) > 1

在多个列标识唯一行的情况下(例如关系表),你可以使用以下

使用行id 例如emp_dept(empid, deptid,开始日期,结束日期) 假设empid和deptid是唯一的,并且在

的情况下标识行
select oed.empid, count(oed.empid)
from emp_dept oed
where exists ( select *
from  emp_dept ied
where oed.rowid <> ied.rowid and
ied.empid = oed.empid and
ied.deptid = oed.deptid )
group by oed.empid having count(oed.empid) > 1 order by count(oed.empid);

如果这样的表有主键,那么使用主键而不是rowid,例如id是pk那么

select oed.empid, count(oed.empid)
from emp_dept oed
where exists ( select *
from  emp_dept ied
where oed.id <> ied.id and
ied.empid = oed.empid and
ied.deptid = oed.deptid )
group by oed.empid having count(oed.empid) > 1 order by count(oed.empid);
SELECT   SocialSecurity_Number, Count(*) no_of_rows
FROM     SocialSecurity
GROUP BY SocialSecurity_Number
HAVING   Count(*) > 1
Order by Count(*) desc

你也可以尝试这样的方法在一个表中列出所有重复的值,比如reqitem

SELECT count(poid)
FROM poitem
WHERE poid = 50
AND rownum < any (SELECT count(*)  FROM poitem WHERE poid = 50)
GROUP BY poid
MINUS
SELECT count(poid)
FROM poitem
WHERE poid in (50)
GROUP BY poid
HAVING count(poid) > 1;

1. 解决方案

select * from emp
where rowid not in
(select max(rowid) from emp group by empno);

我通常使用甲骨文的分析函数ROW_NUMBER ()

假设你想检查关于列(c1c2c3)上构建的唯一索引或主键的重复。 然后你将这样做,调出ROWID行,其中ROW_NUMBER()带来的行数是>1:

Select *
From Table_With_Duplicates
Where Rowid In (Select Rowid
From (Select ROW_NUMBER() Over (
Partition By c1, c2, c3
Order By c1, c2, c3
) nbLines
From Table_With_Duplicates) t2
Where nbLines > 1)

下面是一个SQL请求:

select column_name, count(1)
from table
group by column_name
having count (column_name) > 1;

我知道这是一个老帖子,但这可能会帮助一些人。

如果你需要打印表的其他列,同时检查重复使用如下:

select * from table where column_name in
(select ing.column_name from table ing group by ing.column_name having count(*) > 1)
order by column_name desc;

如果需要,还可以在where子句中添加一些额外的过滤器。