|
Frequently Asked Questions
Can I use WordSmith 3, 4 and 5 at the same time?
Yes, you should be able to. They do not share folders or files with
each other. (WordSmith versions 1-4 are not being developed any further.)
What are the differences
between WordSmith 3 and 4?
In theory, anything WS3 can do WS4 can do better. WS4 has far fewer
limitations, such as the number of concordance lines, etc. There are
not supposed to be any functions which you can do in WS3 which are not
in WS4 too. There are a whole lot of new functions and tools in WS4
too -- click here
for full Online Help and look at Overview | What's new in Version
4.
It crashed -- I sent Microsoft
a message but they cannot solve the problem.
Not your fault! Microsoft won't know how to solve the problem. If
you cannot get into WS4 at all, then there is a chance your
PC is not powerful enough or you are not running an up-to-date operating
system. If you can get in enough to see at least the main WordSmith
Tools screen (top left corner of your screen) and then you get a crash
after that, on quitting you should also get a chance to send me an email
(if you want), which will let me know details of your PC and where the
program ran into problems. You may find a fresh download solves the
problem. WordSmith Tools is often updated -- thanks to helpful feedback
from the user community!
When my temporary
registration expires
If you have been given a free temporary use registration, you may wonder
what happens when it expires. WordSmith will simply revert to demo mode.
Any concordances, wordlists etc you do will be complete as before but
you just won't be able to see or print or save as
text all the findings. You should visit the OUP website and purchase
a registration code. OUP are hoping to be able to have online ordering
so if you have a credit card you should be able to buy the code and
receive it by email within minutes.
Can WordSmith handle
Language X (eg. Igbo)?
See also How WordSmith handles various languages other than English.
There are 3 issues here:
a) processing a string of text and for example finding word breaks,
or seeking concordance lines
b) showing results on a screen and in printed form
c) sorting lists of words in alphabetical order.
processing a string of text
In a pc, each symbol is only a number such as 5,793 or 65, so in principle
any writing system can be accommodated. In WordSmith, recognising word-breaks
is done by assuming there is a space or some such between words. This
does not apply to all languages: in particular, Chinese and Japanese
do not usually mark word-breaks as English does using spaces (code 32).
That means that concordancing should work OK (since we are simply searching
for a string of characters) but word-listing is not possible (though
character-listing is).
showing results
You need a font which translates the numbers into symbols in pixels.
In WordSmith 3, a user can define their own alphabet and own alphabetical
order but there are problems in getting a Windows pc to show things
correctly.
WS3 uses alphabets which can be represented in one byte (a number between
0 and 255), a system which was usual in computers until recently. In
practice with such a one-byte representation system there needs to be
a "codepage" (a table of 256 characters) which contains the
symbols you need. I do not believe there is one suitable for Igbo but
I am not sure: presumably computational linguists in Nigeria will know.
The Windows Eastern Europe codepage covers some but probably not all.
WS4 can use the old one-byte system still, but also 2-byte Unicode.
This contains a very large number of symbols indeed, many thousands
of them. For Unicode you will need a Unicode font. Check "Unicode"
in a search-engine if this is new to you.
Here is a detailed example of
how I made a word list in Igbo.
sorting words
For this, you need a principle for saying whether (for example) an
Igbo sub-dotted o is to belong with the other o's or whether it should
come after z. WS4 now uses the sorting routines provided by Microsoft
instead of my own. The user has to specify which language they are using
and Microsoft's sort routines for English, Portuguese, Japanese, French,
Russian etc do the sorting. I do not think Igbo is included in the list,
but this would only affect the sorting and might not be important in
practice. The best thing to do is to tell WS4 that the language you
are processing is one which does sort acceptably for your data: this
might be Hungarian or Spanish, you will have to experiment to see.
How do I type in
accented characters or symbols if I don't have the right keyboard?
Chinedu Uchechukwu (<neduchi@netscape.net> who works with Igbo
and is based in Germany) tells me "The temporary solution I am
presently using is "Keyman Keyboard" program, which simply
changes the special characters on the German keyboard to Igbo characters."
There may be other solutions available.
Does WordSmith tag texts?
No. You have to tag your own manually or use a tagger to tag them automatically.
Getting text -- Does
WordSmith come with a corpus?
Most corpora are not free for me to distribute. WordSmith 4 comes with Webgetter (see Utilities | Webgetter) which helps download text from the Internet. There is also a link here to the LUCY corpus of texts by kids and students, courtesy of Geoff Sampson (Sussex). Alternatively you can easily
build up your own using Internet resources. There are lots of other corpora,
some of which are freely accessible, others can be purchased cheaply,
and others are extremely expensive. Try a Google search on "text
corpus". Or visit newspaper web sites. Or try the Online
Books page. The British National
Corpus (World Edition) is £50 for a single-user licence, cheap
at the price.
How do I make a custom DLL?
Here's an example, written
in Pascal.
My registration code
didn't work...
Paste in the details you were given. They must be a perfect match!
Supposing you were given this:
Registered to: Maria Antonia Silveira
Other details:
Registration code: SA40.9273.7215.4681.4554
then you'd fill it in like this, pasting from the email to the updating program:
The beta version of WS5 uses the name "temporary use" (not
your own name or "temporary user") and a code supplied in
the readme.txt file. Run updater.exe after installation to register.
For WS3 registration problems contact Oxford University Press.
How do I get WS4 to work on
a network?
- Install ONE copy only, on whichever PC is used for networked programs.
That is, download to a temporary drive as with a stand-alone installation
on any PC, then extract to your network server eg. n:\wsmith4.
- Run n:\wsmith4\wordsmith.exe.
- Paste in the registration code as you would with a stand-alone version.
- Don't forget to check the "For use of network?" box. Leave
the others unchecked (assuming you don't want to associate WordSmith
files for yourself, or to have an icon on your server desktop). Press
OK.
- You will see a message telling you the code is OK (if it is!) and
can choose OK or Help -- click Help before you press OK. If you miss
it, open the main help and look in the installation section under "network
defaults". The rest is explained there. You can click here
to see the Online version.
- If you lose your wordsmith.ini or find the comments have disappeared from it, here is another copy.
Is there a guide for use of WordSmith?
There's a Windows Help file and an Adobe Acrobat wordsmith.pdf manual in your \wsmith4 files and Online Help on this site (all essentially the same).
If you press F1 or right-click and see "What's this?" in the
program you get Windows Help, if you prefer you can print out the .pdf, or you can view help online here.
There is also a get-you-started
online-only guide. There is the manual
(.pdf) too.
Collocate statistics
-- what is available?
For the moment the statistics are Mutual Information, MI3, Log Likelihood, Z Score. You now get a chance to choose between various
appropriate statistical functions.
How
is WordSmith designed?
It is developed using the latest version available of Windows (currently
XP Service Pack 2). The programs are written in the Delphi environment,
essentially a version of Pascal, currently Delphi 2007. The fast
bits are in Assembler. Apart from some purchased components (eg. a tool
for zipping) it is a one-man effort.
Network
Performance issues
SOME TIMINGS
Starting WS4 standalone: 1 second?
Starting WS4 networked on home wireless network, 1st time: approx
15 secs.
Starting WS4 networked on home wireless network, subsequently: approx
13 secs.
WS4 networked making a word list of 480 specific texts on local drive:
15 secs.
WS4 networked making a word list of 480 specific texts on network
drive: 46 secs.
Network performance will depend on the efficiency of your network equipment
and network protocols and the amount of traffic on your servers at the
time, factors outside WS4's control, of course. As far as WS4 is concerned,
the times when the network is most heavily used are
a) starting up WS4
b) accessing any network as opposed to local hard drives on the individual
PC.
The network drive where the many component files of the original program
are stored is one factor in this. Another is the drive which users may
read & write to: this may be a network drive or a local hard drive,
depending on wordsmith.ini as concerns this section of it:
[NETWORK]
;network-read/write folder=
;remove semi-colon in line above if running on network
;and change directory to one that users can write to
;prohibited drives=xyz
;limited folder=
; this one only allows the user to see that directory
; & any of its sub-dirs on that drive + speeds processing
Let's imagine a site licence situation where the original programs
are at V:\WordSmith4. There is a copy of wordsmith.ini at V:\WordSmith4\wordsmith.ini.
Let's imagine this says
[NETWORK]
network-read/write folder=M:\Wsmith4
and that drive M: is a network drive, not local to any PC to
be used by WS4 users. Then there will be network traffic whenever users
want to save results, access saved results or personal settings on drive
M:\Wsmith4.
If it says
[NETWORK]
network-read/write folder=C:\Wsmith4
and C: is a local drive on each user's PC there will NOT be network
traffic whenever users want to save results, access saved results or
personal settings on drive C:\Wsmith4.
Network administrators may or may not allow writing to C:\ of course.
At Liverpool, they allow writing to C:\temp but nowhere else, I think.
STARTING UP
I don't think we can do anything about start-up because the operating
system needs to access the program at V:\Wsmith4\. If the traffic is
jammed it will take time. Standalone (non-networked) startup is around
3 seconds on my PC. Networked startup is bound to require all the resources
of wordsmith.exe to be brought over to the networked PC. The same applies
to starting up any individual Tool such as Concord. If a set of 50 users
double-click the program icon simultaneously they'll be in a queue.
At our installation in Liverpool it does take quite a time, maybe 15-20
seconds on one PC (I haven't yet timed it but that's my impression).
I have not experienced frustration in Liverpool with this.
AFTER START-UP
Drives get accessed when the user saves or opens any results files (by
definition) and whenever WS4 needs to know some setting or other from
the drive directory where their settings are stored. In Liverpool I
have experienced frustration here and got my students to save at C:\temp with MUCH faster performance.
Which Reference Corpus to use?
I think of it like this. If you compared an apple, let's say a Fuji apple, to a whole lot of apples of various colours and sizes, you would find out the specific nature of the variety of your apple (e.g. a Fuji might be sweeter or redder or bigger than most) but wouldn't find out the "apple-ness" -- for example the fact your apple has a thin skin and small pips would not be at all outstanding, because they all have thin skins and small pips. If you compared the same apple with a mixture of various fruits, its thin skin would be more likely to be outstanding if the mix of fruits contained oranges, bananas, pineapples etc, even if it also contained a few other apples too. What you'd get would be the "apple-ness" in your comparison. You might lose some of the "Fuji-ness". Finally, if you compared the apple to a whole lot of things in your kitchen including other fruit, knives, chairs, cloves of garlic, olive oil, glasses and light bulbs, etc, you would get the "fruit-ness" as well: e.g. the fact that it is relatively solid and doesn't break if dropped but gets bruised. You'd presumably lose some of the "apple-ness" and "Fuji-ness".
Showing Line numbers in a Concordance

To get results like this where we have a Shakespeare sonnet with line numbers, first ensure all line numbers are marked like this:
<1> From fairest creatures we desire increase,
<2> That thereby beauty's rose might never die,
<3> But as the riper should by time decease,
<4> His tender heir might bear his memory:
Then create a tag file containing just this one line:
<#>
Now in Adjust Settings | Tags, load the tag file. Generate your concordance and you should see the line numbers in the Tag column.
Accessing Help with a network installation
Microsoft have moved the goalposts. If you find your users cannot access the WordSmith help across the network, there is a solution here.
"Problem making wordlist" error message (or concordance)
Some time in July or August 2006 there was for a few days a problem of this kind. It comes when WS4 fails to find text files which you had chosen in the file choosing part of the Controller. A workaround would be to import a list of the text files which you have made using Notepad. (To do that, press the Get Favourites button, change the file type at the bottom from "favourite text files (*.dat)" to "self-edited Text file (*.txt)". Then once the list has been successfully read in, save it by pressing the red Save Favourites button to avoid having to do that again.... That way you avoid any problems that might arise in the file chooser which has to be able to get files from the left side to the right side of itself correctly to work properly.
|