的方法有很多在SQL Server中拆分字符串。本文介紹了幾乎每一個方法的優點和缺點:"Arrays and Lists in SQL Server 2005 and Beyond, When Table Value Parameters Do Not Cut it" by Erland Sommarskog
I prefer the number table approach to split a string in TSQL,此方法的工作,你需要做的這一個時間表設置:
SELECT TOP 10000 IDENTITY(int,1,1) AS Number
INTO Numbers
FROM sys.objects s1
CROSS JOIN sys.objects s2
ALTER TABLE Numbers ADD CONSTRAINT PK_Numbers PRIMARY KEY CLUSTERED (Number)
一旦Numbers表格設置,創建此分割功能:
CREATE FUNCTION [dbo].[FN_ListToTable]
(
@SplitOn char(1) --REQUIRED, the character to split the @List string on
,@List varchar(8000)--REQUIRED, the list to split apart
)
RETURNS TABLE
AS
RETURN
(
----------------
--SINGLE QUERY-- --this will not return empty rows
----------------
SELECT
ListValue
FROM (SELECT
LTRIM(RTRIM(SUBSTRING(List2, number+1, CHARINDEX(@SplitOn, List2, number+1)-number - 1))) AS ListValue
FROM (
SELECT @SplitOn + @List + @SplitOn AS List2
) AS dt
INNER JOIN Numbers n ON n.Number < LEN(dt.List2)
WHERE SUBSTRING(List2, number, 1) = @SplitOn
) dt2
WHERE ListValue IS NOT NULL AND ListValue!=''
);
GO
您現在可以輕鬆地拆分CSV字符串轉換成表格,並加入就可以了:
select * from dbo.FN_ListToTable(',','1,2,3,,,4,5,6777,,,')
OUTPUT:
ListValue
-----------------------
1
2
3
4
5
6777
(6 row(s) affected)
你現在可以加入到您的CSV像分割:
DECLARE @YourTable table (RowID int, RowValue varchar(200))
INSERT INTO @YourTable VALUES (1,'aaa bbb ccc ddd eee fff ggg hhh')
INSERT INTO @YourTable VALUES (2,'bbb ddd fff hhh')
INSERT INTO @YourTable VALUES (3,'aaa bbb zzz')
DECLARE @Words varchar(500)
SET @Words='aaa,bbb,ccc,zzz'
SELECT
COUNT(y.RowID) AS CountOF,l.ListValue
FROM @YourTable y
INNER JOIN dbo.FN_ListToTable(',',@Words) AS l ON y.RowValue LIKE '%'+l.ListValue+'%'
GROUP BY l.ListValue
OUTPUT:
CountOF ListValue
----------- ---------------
2 aaa
3 bbb
1 ccc
1 zzz
(4 row(s) affected)
也是其值得一提的是SQL Server Express的2008 R2與高級服務(免費)包括全文搜索 - 以防萬一Express Edition目前是您不使用全文搜索的限制。 – cfeduke 2010-05-04 20:50:28
@cfeduke感謝您的額外細節。這個特定的應用程序正在使用MSSQL 2005企業版,所以不用擔心什麼是或不包括在內。 @Mark Byers這看起來最有前途,所以今天我會玩。 – 2010-05-05 14:19:10
所以這工作得很好,在我想出瞭如何用全文搜索做到這一點之後 - 除了遺漏了我知道頻繁出現的單詞。我相信這是因爲噪音詞的刪除(它刪除的詞的例子:A,I,Have,Did)......但是我想**保留它們!它們對我們正在進行的研究很重要。有任何想法嗎? – 2010-05-07 14:56:52