100字范文 > oracle中fn_getpy函数 SQL Server根据汉字笔划和取得拼音首字母进行排序

oracle中fn_getpy函数 SQL Server根据汉字笔划和取得拼音首字母进行排序

时间：2021-01-17 21:23:37

select word

from #t1 a

left join #t1 b on a.id=b.id-1 and a.code

where b.code is null

order by a.id

得到

个汉字，每个汉字都是每种笔划数按

chinese_prc_stroke_cs_as_ks_ws

排序规则排序后的

最后一个汉字：

亅阝马风龙齐龟齿鸩龀龛龂龆龈龊龍龠龎龐龑龡龢龝齹龣龥齈龞麷鸞麣龖龗齾齉龘

上面可以看出：

“

亅

”

是所有一笔汉字排序后的最后一个字，

“

阝

”

是所有二笔汉字排序后的最后

一个字

......

等等。

但同时也发现，从第

个汉字

“

龗

(33

笔

)”

后面的笔划有些乱，不正确。但没关系，比

“

龗

”

笔划

多的只有四个汉字，我们手工加上：齾

笔，齉

笔，靐

笔，龘

笔

建汉字笔划表(

tab_hzbh

)：

create table tab_hzbh(id int identity,cnword nchar(1))

先插入前

个汉字

insert tab_hzbh

select top 33 word

from #t1 a

left join #t1 b on a.id=b.id-1 and a.code

where b.code is null

order by a.id

再加最后四个汉字

set identity_insert tab_hzbh on

insert tab_hzbh(id,cnword)

select 35,n

齾

union all select 36,n

齉

union all select 39,n

靐

union all select 64,n

龘

set identity_insert tab_hzbh off

到此为止，我们可以得到结果了，比如我们想得到汉字

“

国

”

的笔划：

declare @a nchar(1)

set @a=

国

select top 1 id

from tab_hzbh

where cnword>=@a collate chinese_prc_stroke_cs_as_ks_ws

order by id

-----------

(

结果：汉字

“

国

”

笔划数为

上面所有准备过程，只是为了写下面这个函数，这个函数撇开上面建的所有临时表和固

定表，为了通用和代码转移方便，把表

tab_hzbh

的内容写在语句内，然后计算用户输入一串

汉字的总笔划：

create function fun_getbh(@str nvarchar(4000))

returns int

begin

declare @word nchar(1),@n int

set @n=0

while len(@str)>0

begin

set @word=left(@str,1)

如果非汉字，笔划当

计

set @n=@n+(case when unicode(@word) between 19968 and 19968+20901

then (select top 1 id from (

select 1 as id,n

亅

as word

union all select 2,n

阝

union all select 3,n

马

union all select 4,n

风

union all select 5,n

龙

union all select 6,n

齐

union all select 7,n

龟

union all select 8,n

齿

union all select 9,n

鸩

union all select 10,n

龀

union all select 11,n

龛

union all select 12,n

龂

union all select 13,n

龆

union all select 14,n

龈

union all select 15,n

龊

union all select 16,n

龍

union all select 17,n

龠

union all select 18,n

龎

union all select 19,n

龐

union all select 20,n

龑

union all select 21,n

龡

union all select 22,n

龢

union all select 23,n

龝

union all select 24,n

齹

union all select 25,n

龣

union all select 26,n

龥

union all select 27,n

齈

union all select 28,n

龞

union all select 29,n

麷

union all select 30,n

鸞

union all select 31,n

麣

union all select 32,n

龖

union all select 33,n

龗

union all select 35,n

齾

union all select 36,n

齉

union all select 39,n

靐

union all select 64,n

龘

) t

where word>=@word collate chinese_prc_stroke_cs_as_ks_ws

order by id asc) else 0 end)

set @str=right(@str,len(@str)-1)

end

return @n

end

函数调用实例：

select dbo.fun_getbh(

中华人民共和国

),dbo.fun_getbh(

中華人民共和國

)

执行结果：笔划总数分别为

和

，简繁体都行。

当然，你也可以把上面

“union

all”

内的汉字和笔划改存在固定表内，在汉字

列建

clustered index

，列排序规则设定为：

chinese_prc_stroke_cs_as_ks_ws

这样速度更快。如果你用的是

big5

码的操作系统，你得另外生成汉字，方法一样。

但有一点要记住：这些汉字是通过

sql

语句

select

出来的，不是手工输入的，更不

是查字典得来的，因为新华字典毕竟不同于

unicode

字符集，查字典的结果会不正

确。

用排序规则的特性得到汉字拼音首字母

用得到笔划总数相同的方法，我们也可以写出求汉字拼音首字母的函数。如下：

create function fun_getpy(@str nvarchar(4000))

returns nvarchar(4000)

begin

declare @word nchar(1),@py nvarchar(4000)

set @py=

while len(@str)>0

begin

set @word=left(@str,1)

如果非汉字字符，返回原字符

set @py=@py+(case when unicode(@word) between 19968 and 19968+20901

then (select top 1 py from (

select a as py,n

驁

as word

union all select b,n

簿

union all select c,n

錯

union all select d,n

鵽

union all select e,n

樲

union all select f,n

鰒

union all select g,n

腂

union all select h,n

夻

union all select j,n

攈

union all select k,n

穒

union all select l,n

鱳

union all select m,n

旀

union all select n,n

桛

union all select o,n

漚

union all select p,n

曝

union all select q,n

囕

union all select r,n

鶸

union all select s,n

蜶

union all select t,n

籜

union all select w,n

鶩

union all select x,n

鑂

union all select y,n

韻

union all select z,n

咗

) t

where word>=@word collate chinese_prc_cs_as_ks_ws

order by py asc) else @word end)

set @str=right(@str,len(@str)-1)

end

return @py

end

函数调用实例：

select dbo.fun_getpy(

中华人民共和国

),dbo.fun_getpy(

中華人民共和國

)

结果都为：

zhrmghg

也可用相同的方法，扩展为得到汉字全拼的函数，甚至还可以得到全拼的读音声调，不过全拼分类大多了。得到全拼最好是用对照表，两万多汉字搜索速度很快，用对照表还可以充分利用表的索引。

本内容不代表本网观点和政治立场，如有侵犯你的权益请联系我们处理。

网友评论

网友评论仅供其表达个人看法，并不表明网站立场。