NLA Trial index

NLA Trial Articles from 1955

  1. Accuracy of OCR and overProof is measured in comparison with the human corrections. We know human corrections in this sample are incomplete, and themselves contain errors, but they are the best we could find automatically from the NLA newspapers corpus, tagged as completely corrected then further filtered to those with at least 3 corrections, at least 40% of lines corrected and lowest third percentage of non-dictionary words.
  2. Accuracy is measured by a separate process from that used to colour words in this output: the colouring process is heuristic, and not completely accurate.
  3. Colour legend:
    Text - OCR text corrected by human and/or overProof
    Text - human and/or overProof corrections
    Text - discrepencies between human and/or overProof
    Text - human corrections not applied by overProof
  4. Identified overProof corrections are calculated by the statistical calculation process, and shows those words changed by overProof which ALSO match human corrections. As human corrections are often wrong and incomplete, so too is this list.
  5. Identified overProof non-corrections are calculated by the statistical calculation process, and shows those words in the overProof output which DO NOT MATCH human corrections. As human corrections are often wrong and incomplete, so too is this list. Words marked as [**VANDALISED] are those which have been changed by overProof but not by the human correction; as before, a missed human correction will be (incorrectly) classified as vandalisation by overProof.
  6. Searchability of unique words refers to the distinct words in an article, and how many are present before and after correction. It is measure of how many of the words within an article could be used to find the article using a search engine.
  7. Weighted Words refers to a calculation in which common words count for little (a fraction of a word) and unusual words count for more, in proportion to the log of the inverse of their frequency in the corpus. It may be an indicator of how well distinctive words in an article can be searched before and after correction.

Article ID 62535749, Article, Six N.S.W. Fatalities Over The Week-end, page 1 1955-01-03, Townsville Daily Bulletin (Qld. : 1907 - 1954), 166 words, 3 corrections

Raw OCRHuman CorrectedoverProof Corrected
Sri N.S.W. Fatalties Six N.S.W. Fatalities Sir N.S.W. Fatalities
Orer The Week-end Over The Week-end Over The Week-end
*aZI-NET, January 2.— A SYDNEY, January 2.— A ZERNER, January 22 A
total of four people were kill total of four people were kill- total of four people were killed
ed In road accidents, and two ed in road accidents, and two in road accidents, and two
drowned-ln New South Wales drowned in New South Wales drowned in New South Wales
over the week-end. over the week-end. over the week-end.
. Both drowning* occurred Both drownings occurred Both drownings occurred
to-day and the road accidents to-day and the road accidents to-day and the road accidents
on Saturday. on Saturday. on Saturday.
Arthur Henry Flsnuery Arthur Henry Flannery Arthur Henry Flannery
(34), of Homebush, Sydney, (34), of Homebush, Sydney, (34), of Homebush, Sydney,
was drowned when his canoe was drowned when his canoe was drowned when his canoe
wa» swamped in heavy sea* was swamped in heavy seas was swamped in heavy seas
at Palm Beach, Sydney. at Palm Beach, Sydney. at Palm Beach, Sydney.
Phillip David Cuthbert (12), Phillip David Cuthbert (12), Phillip David Cuthbert (12),
«f Smlthfleld, Sydney, was of Smithfleld, Sydney, was of Smithfield, Sydney, was
drowned when be slipped drowned when he slipped drowned when he slipped
from a rubber float at Bob from a rubber float at Bob- from a rubber float at Bob
Vm Head, near Sydney. He bie Head, near Sydney. He Vm Head, near Sydney. He
could not awlm. could not swim. could not swim.
The road death* were:— The road deaths were :— The road deaths were
Mary Summer* (B8), of Mary Summers (58), of Mary Summers (98), of
Newcastle, who Was hit by a Newcastle, who was hit by a Newcastle, who was hit by a
truck. truck. truck.
Anne Lynette McGllvray Anne Lynette McGilvray Anne Lynette McGilvray
-8), of Nambucca Head*, (8), of Nambucca Heads, (8), of Nambucca Heads,
Mew South Wales, was In a New South Wales, was in a New South Wales, was in a
car which ran off the road car which ran off the road car which ran off the road
and hit a telegraph pole. and hit a telegraph pole. and hit a telegraph pole.
Athol Henry, Hutchison Athol Henry Hutchison Athol Henry, Hutchison
-M-, of Mlrtagong, New (28), of Mittagong, New -M-, of Mittagong, New
Wales, was thrown from the Wales, was thrown from the Wales, was thrown from the
utility he was driving when utility he was driving when utility he was driving when
it skidded in loose gravel. it skidded in loose gravel. it skidded in loose gravel.
June Helen -Burgess Foy June Helen Burgess Foy June Helen Burgess Foy
-8Sfciof Maltland, New South (25), of Maitland, New South -Stief Maitland, New South
Wales, wa* in a car which Wales, was in a car which Wales, was in a car which
collided with another car. collided with another car. collided with another car.
Identified overProof non-corrections BOBBIE SIX SMITHFLELD
accuracy %
accuracy %
corrected %
All Words15084.796.778.3
Searchability of unique words9685.496.978.6
Weighted Words86.697.178.6

Article ID 79255893, Article, DIESELS IN AUSTRALIA, page 29 1955-10-27, The Central Queensland Herald (Rockhampton, Qld. : 1930 - 1956), 203 words, 3 corrections

Raw OCRHuman CorrectedoverProof Corrected
All Australian States now All Australian States now All Australian States now
have programmes to replace have programmes to replace have programmes to replace
their steam locomotives with their steam locomotives with their steam locomotives with
dlesel-electrlcs. Hits has re diesel-electrics. This has re- diesel-electric. Hits has resulted
sulted from the success with sulted from the success with from the success with
which dlesel-electrlcs have which diesel-electrics have which diesel-electric have
been operated in Australia, been operated in Australia. been operated in Australia,
Railway authorities are find Railway authorities are find- Railway authorities are finding
ing dlesel-electrlcs the most ing diesel-electrics the most diesel-electric the most
effective answer to serious effective answer to serious effective answer to serious
competition from air trans competition from air trans- competition from air transport.
port. port.
Hie great amount of trav The great amount of trav- The great amount of travelling
elling time saved by replacing elling time saved by replacing time saved by replacing
steam engine* with dlesel steam engines with diesel- steam engines with diesel
electrlcs la a vital factor in electrics is a vital factor in electrics is a vital factor in
this country of great distan this country of great distan- this country of great distances.
ces. ces.
For instance the travelling For instance the travelling For instance the travelling
time on the run from Perth time on the run from Perth time on the run from Perth
to Melbourne has been cut to Melbourne has been cut to Melbourne has been cut
from 841 hours to 501 hours from 841 hours to 501 hours from 841 hours to 501 hours
since dlesel-electrlcs went on since diesel-electrics went on since diesel-electric went on
the line. They have also been the line. They have also been the line. They have also been
responsible for dicing live responsible for slicing five responsible for dining five
hours off the trip from Sydney hours off the trip from Sydney hours off the trip from Sydney
to Brisbane. to Brisbane. to Brisbane.
At the "tage when dlenel At the stage when diesel At the "stage when diesel
electrics made up only & per electrics made up only 5 per electrics made up only a per
cent, of the locomotives in cent. of the locomotives in cent, of the locomotives in
Australia, they covered 12 per Australia, they covered 12 per Australia, they covered 12 per
cent of the total mileage re cent. of the total mileage re- cent of the total mileage recorded
corded by all railways. corded by all railways. by all railways.
Diesel-electric locomotives Diesel-electric locomotives Diesel-electric locomotives
are also proving outstandingly are also proving outstandingly are also proving outstandingly
economical to run. economical to run. economical to run.
Commonwealth Railways Commonwealth Railways Commonwealth Railways
have reported that on the have reported that on the have reported that on the
trans-Australia line, fuel cost trans-Australia line, fuel cost trans-Australian line, fuel cost
per mile for dlesel-electrlcs per mile for diesel-electrics per mile for diesel-electric
is I6.664d. compared with is 16.664d. compared with is I6 664. compared with
64.841d. for team. Crew's 64.841d. for steam. Crew's 64 84. for team. Crew's
wages per mile - for dlesel wages per mile for diesel wages per mile - for diesel
electrics is 13.423d. compared electrics is 13.423d. compared electrics is 13.4 23d. compared
woth 23.72M.>for steam. The with 23.729d. for steam. The with 23.72M.>for steam. The
respective maintenance costs respective maintenance costs respective maintenance costs
are 6.02Bd. and 29.330d. a mile. are 6.028d. and 29.330d. a mile. are 6.02. and 29.30. a mile.
Identified overProof corrections ENGINES STAGE FIVE
Identified overProof non-corrections SLICING
accuracy %
accuracy %
corrected %
All Words18787.795.260.9
Searchability of unique words10396.199.075.0
Weighted Words96.899.275.0

Accumulated stats for 2 articles from year 1955

accuracy %
accuracy %
corrected %
All Words33786.495.969.7
Searchability of unique words19990.998.077.8
Weighted Words92.398.377.6