少妇无码一区二区三区,无码国产激情在线观看

論壇徽章:: 0

電梯直達(dá)

1樓 [收藏(0)] [報(bào)告]

發(fā)表于 2016-08-04 17:28 |只看該作者 |倒序?yàn)g覽

剛開(kāi)始的思路是：
將整個(gè)文件讀取，然后按照空格切割后保存于數(shù)組中，然后遍歷數(shù)組創(chuàng)建哈希表。但是如果文章很長(zhǎng)，并且有多個(gè)文章的話，
先保存數(shù)組有點(diǎn)不太妥，效率太低，請(qǐng)問(wèn)如何改進(jìn)，使得當(dāng)讀入文件的時(shí)候不創(chuàng)建臨時(shí)數(shù)組直接創(chuàng)建哈希表呢？
text_in:
The U.N. Food and Agriculture Organization says it has less than half the funding it needs to help ensure food security in parts of South Sudan.
.......
(太多先不貼出來(lái)了，假設(shè)文本很規(guī)范)

創(chuàng)建如下的哈希表%Words:
(
The => 1,
U.N. => 1,
Food => 1,
...
)

我之前的想法是：
my $content;

{
local $/= undef;
$content = <$IN1>;
close($IN1);
#print "$content\n";
}

my @words1 = split /\s/,$content;
my %Words1 = map{$_ => 1} @words1;

可不可以不用臨時(shí)的數(shù)組呢，直接創(chuàng)建哈希表，那樣會(huì)不會(huì)更快呢？

文庫(kù)|博客

使用正則表達(dá)式與lex實(shí)現(xiàn)詞法分析器
C語(yǔ)言的MIPS匯編實(shí)現(xiàn)（四）SWITCH
Requested init /linuxrc failed (error -2).
比較 csv 文件中數(shù)據(jù)差異
LMD ElPack v2019.7新版亮點(diǎn)：Transparent mode全新升級(jí)|附下載

sunzhiguolu

巨富豪門

論壇徽章:: 307

程序設(shè)計(jì)版塊每周發(fā)帖之星
日期:2016-04-08 00:41:33

操作系統(tǒng)版塊每日發(fā)帖之星
日期:2015-09-02 06:20:00

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2015-09-04 06:20:00

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2015-09-09 06:20:00

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2015-09-19 06:20:00

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2015-09-20 06:20:00

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2015-09-22 06:20:00

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2015-09-24 06:20:00

2樓 [報(bào)告]

發(fā)表于 2016-08-04 18:34 |只看該作者

perl -anle '{$h{$_}++ for @F}END{$,=",";print keys %h}' f

復(fù)制代碼

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫(kù)RadonDB知多少？

104359176

豐衣足食

求職 : 軟件工程師

論壇徽章:: 3

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2015-10-07 06:20:00

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2015-12-13 06:20:00

程序設(shè)計(jì)版塊每日發(fā)帖之星
日期:2016-05-05 06:20:00

3樓 [報(bào)告]

發(fā)表于 2016-08-04 22:06 |只看該作者

use local is easy to slurp all text to a string. not related with speed.

If you want more rapid, use array and uniq it.

實(shí)戰(zhàn)分享：從技術(shù)角度談機(jī)器學(xué)習(xí)入門| 【大話IT】RadonDB低門檻向MySQL集群下戰(zhàn)書 | ChinaUnix打賞功能已上線！ | 新一代分布式關(guān)系型數(shù)據(jù)庫(kù)RadonDB知多少？

jason680

富可敵國(guó)

論壇徽章:: 145

4樓 [報(bào)告]

發(fā)表于 2016-08-04 23:20 |只看該作者

本帖最后由 jason680 于 2016-08-05 11:02 編輯

回復(fù) 1# 大山里出來(lái)的孩子

$ perl words.pl text_in
the half ensure of needs has Sudan. Food Agriculture to funding less in help says Organization it South than U.N. food parts security and The

$ cat words.pl
use strict;
use warnings;

my %hWord;

while(<>){
chomp;
$hWord{$_}=1 for(split);
}
print join(" ",keys %hWord),"\n";