know和know about的区别 基于coca corpus

know -about / know about + *

由于这样搜索导致coca出不来结果(耗时太长

只能基于ngrams了

 

基于3gram 不过结果有点少

n't know about, 3249
to know about, 3038
know about the, 1731
you know about, 1002
know about it, 931
we know about, 865
know about that, 759
I know about, 721
know about this, 565
know about you, 385
not know about, 371
should know about, 292
they know about, 278
even know about, 194
know about him, 180
people know about, 170
know about your, 167
know about them, 166
know about his, 139
You know about, 137
know about what, 134
know about my, 131
nt know about, 131
know about a, 124
know about me, 124
he know about, 122
really know about, 118
know about these, 116
do know about, 113
all know about, 103
know about their, 100
already know about, 90
We know about, 88
know about her, 79
who know about, 75
know about all, 73
would know about, 70
to Know About, 69
know about any, 63
she know about, 63
They know about, 58
know about our, 54
now know about, 49
know about how, 48
will know about, 48
never know about, 47
must know about, 46
know about us, 45
know about those, 42
did know about, 41
know about and, 36
may know about, 29
n't Know About, 28
know about life, 27
them know about, 26
know about sex, 25
us know about, 25
might know about, 24

 

 

我知道了,我上次用的antconc计数的

about: 58 *
you: 3
n't: 2
to: 2
we: 2
they: 2
them: 2 *
all: 2 *
us: 2 *
the: 1
it: 1
that: 1
i: 1
this: 1
not: 1
should: 1 ***
even: 1 *
him: 1 *
people: 1 *
your: 1 *
his: 1 *
what: 1
my: 1 *
nt: 1 ***
a: 1
me: 1 *
he: 1
really: 1 ***
these: 1 *
do: 1
their: 1 *
already: 1 ***
her: 1 *
who: 1 *
would: 1 *
any: 1 ***
she: 1 *
our: 1 *
now: 1 *
how: 1 *
will: 1 *
never: 1 ***
must: 1 ***
those: 1 ***
did: 1 *
and: 1
may: 1 ***
life: 1 ***
sex: 1 ***
might: 1 ***

 

 

 

好吧,发现know about没有特别的搭配词(好吧,是因为我用的.+know about.+,应该用.*,部分没删除掉

1 58 @@about@@
2 58 @@know@@
3 3 @@you@@
4 2 @@all@@
5 2 @@n@@
6 2 @@t@@
7 2 @@them@@
8 2 @@they@@
9 2 @@to@@
10 2 @@us@@
11 2 @@we@@
12 1 @@a@@
13 1 @@already@@
14 1 @@and@@
15 1 @@any@@
16 1 @@did@@
17 1 @@do@@
18 1 @@even@@
19 1 @@he@@
20 1 @@her@@
21 1 @@him@@
22 1 @@his@@
23 1 @@how@@
24 1 @@i@@
25 1 @@it@@
26 1 @@life@@
27 1 @@may@@
28 1 @@me@@
29 1 @@might@@
30 1 @@must@@
31 1 @@my@@
32 1 @@never@@
33 1 @@not@@
34 1 @@now@@
35 1 @@nt@@
36 1 @@our@@
37 1 @@people@@
38 1 @@really@@
39 1 @@sex@@
40 1 @@she@@
41 1 @@should@@
42 1 @@that@@
43 1 @@the@@
44 1 @@their@@
45 1 @@these@@
46 1 @@this@@
47 1 @@those@@
48 1 @@what@@
49 1 @@who@@
50 1 @@will@@
51 1 @@would@@
52 1 @@your@@

 

 

正确的结果: 好吧还是只有一个sex

1 58 @@about@@
2 58 @@know@@
3 3 @@you@@
4 2 @@all@@
5 2 @@n@@
6 2 @@t@@
7 2 @@them@@
8 2 @@they@@
9 2 @@to@@
10 2 @@us@@
11 2 @@we@@
12 1 @@a@@
13 1 @@already@@
14 1 @@and@@
15 1 @@any@@
16 1 @@did@@
17 1 @@do@@
18 1 @@even@@
19 1 @@he@@
20 1 @@her@@
21 1 @@him@@
22 1 @@his@@
23 1 @@how@@
24 1 @@i@@
25 1 @@it@@
26 1 @@life@@
27 1 @@may@@
28 1 @@me@@
29 1 @@might@@
30 1 @@must@@
31 1 @@my@@
32 1 @@never@@
33 1 @@not@@
34 1 @@now@@
35 1 @@nt@@
36 1 @@our@@
37 1 @@people@@
38 1 @@really@@
39 1 sex
40 1 @@she@@
41 1 @@should@@
42 1 @@that@@
43 1 @@the@@
44 1 @@their@@
45 1 @@these@@
46 1 @@this@@
47 1 @@those@@
48 1 @@what@@
49 1 @@who@@
50 1 @@will@@
51 1 @@would@@
52 1 @@your@@

 

 

提示

The word know occurs 2,112,089 times in the corpus.


In COCA, you can usually find the collocates for high-frequency words like this, as long as:


1) you search by "lemma" (capitalize the "dictionary form" of the word(s), e.g. DECIDE instead of decides)
2) you leave the "span" set to 4 words left and 4 words right, and
3) you don't limit by section (e.g. by time period, genre, or dialect, depending on the corpus).
4) you don't use Virtual Corpora
5) you don't choose to see the frequency by section (i.e. you need to de-select the box to the left of "Sections" in the search form).


You might also re-do the search with one of the following:


1. Reverse the WORD and COLLOCATES fields, by putting the least frequent word in the [WORD] field, or
2. Do the search as a string search, e.g. ADJ health, instead of health + ADJ collocates, or VERB the money instead of money + NOUN collocates.
 


Most importantly, there is a MUCH better way to get lots of collocates for high frequency words, than searching for them one by one in the online corpus. WWW.COLLOCATES.INFO allows you to download millions of node / collocate pairs from either COCA (13.5 million node/collocate pairs) or iWeb (33 million node/collocate pairs). It will probably be much better for you, and it will definitely decrease the load on the corpus server as well.

 

另一个思路

 

 

know -about

@@you@@
@@i@@
@@n't@@
@@do@@
@@we@@
@@did@@
@@want@@
@@let@@
@@does@@
@@even@@
@@really@@
@@need@@
never
yeah
@@wanted@@
@@already@@
@@wants@@
@@wan@@
@@nt@@
@@letting@@
@@t@@
l
@@anybody@@
don
ya
um
@@ought@@
honestly
wanting
lets
curious
pretend
recipient
y'all
[sighs]
didn
ye
demanded
[chuckles]
comforting
[laughs]
-i
[?]
presume
[?
d'
ou
i-i-i
reassuring
[laughter]
cos
smartest
instinctively
inquiring
doesn
intuitively
cuz
[scoffs]
[chuckling]
y-you
[clears_throat]
niggas
[you]
gratifying
uneed
iike
yuu
profess
l-i
lemme
#i
bravest
b/c
you-all
wouldn
i-i

 

know about

@@do@@
@@n't@@
@@you@@
@@i@@
what
@@we@@
@@did@@
@@want@@
they
@@need@@
how
people
everything
@@does@@
should
@@even@@
@@let@@
things
much
@@wanted@@
@@really@@
thing
@@already@@
else
tell
na
needs
@@wants@@
ever
anyone
needed
@@wan@@
information
everyone
hell
@@nt@@
@@letting@@
given
parents
supposed
americans
nobody
@@anybody@@
everybody
knowing
@@t@@
possibly
ones
@@ought@@
fuck
readers
taught

 

 

 

right 4

know -about

1  i
2  what
3  how
4  if
5  're
6  'm
7  where
8  why
9  going
10  've
11  mean
12  anything
13  better
14  maybe
15  doing
16  exactly
17  whether
18  happened
19  talking
20  means
21  answer
22  truth
23  sounds
24  happens
25  feels
26  anybody
27  happening
28  funny
29  ai
30  personally
31  loves
32  existed
33  exact
34  drill
35  sayin
36  hurts
37  trivia
38  kinda
39  react
40  certainty
41  @@y'all@@
42  nothin
43  firsthand
44  talkin
45  hates
46  fucked
47  [?]
48  goin
49  sucks
50  doin
51  intimately
52  whereabouts
53  specifics
54  basics
55  somethin
56  whats
57  messed
58  bothers
59  thinkin
60  messing
61  beforehand
62  thyself
63  squat
64  pronounce
65  ins
66  scares
67  cpr
68  corny
69  instinctively
70  outs
71  first-hand
72  karate
73  shortcut
74  intuitively
75  cliche
76  caged
77  i-i-i
78  clap
79  iike
80  particulars
81  whati
82  thati
83  pisses
84  whereof
85  nothings
86  motivates
87  whence
88  cliché
89  redeemer
90  goofs
91  lingo
92  offhand
93  you're
94  fleischman
95  niggas
96  whatyou
97  whyi
98  so-and-so
99  ifyou
100  missin

 

know about

(经过我一番验证,虽然这下边看似上边know -about没有,但是实际上词频可能小于know -about,因为know -about实在频率太高,coca老是报错

1  ?
2  this
3  until
4  guys
5  yet
6  guy
7  stuff
8  sex
9  rest
10  situation
11  anyway
12  politics
13  relationship
14  condition
15  disease
16  universe
17  topic
18  secret
19  huh
20  climate
21  murder
22  affair
23  gretna
24  bellevue
25  ralston
26  papillion
27  virus
28  guns
29  risks
30  operation
31  islam
32  computers
33  biology
34  evolution
35  babies
36  childhood
37  incident
38  accident
39  dangers
40  @@y'all@@
41  pregnancy

posted @ 2024-12-07 15:45  hrdom  阅读(7)  评论(0编辑  收藏  举报