|
Background
Dermoscopy (dermatoscopy, epiluminescence
microscopy, incident light microscopy, surface microscopy)
is widely used nowadays for the diagnosis of pigmented skin
lesions.
The method is getting more and more
popular, not only among dermatologists, but also among oncologists,
surgeons, pediatricians,
and even general physicians. However, there is still a need
for better standardization of the dermoscopic
terminology, which has not been reviewed since the last (and
only) Consensus Meeting on Dermoscopy
was held in Hamburg more than ten years ago, in November, 1989.
With a new Consensus Meeting,
we will attempt to refine the dermoscopic terminology and to
simplify the diagnostic categories
of pigmented skin lesions. Because dermoscopic images today
can be easily transmitted
via telecommunication devices, we plan to organize a virtual
Consensus Meeting via the Internet,
known as the Consensus Net Meeting on Dermoscopy (CNMD).
Goals
Main issue
To prove the validity of a two-step procedure
for the dermoscopic classification of pigmented skin lesions.
Additional issues
-
Consensus on terminology of dermoscopic criteria
by refinement of established criteria and definition
of new criteria
-
Diagnostic categories, as far as diagnosable
by dermoscopy
-
Consensus on management decisions based on
dermoscopic evaluation
-
Consensus on follow-up strategies, including
digital documentation
-
Consensus on validity of teledermoscopy and
elaboration of standard requirements for teledermoscopy
(DDDI-standard)
Work plan
The Preliminary scientific design, including
the Web design, was presented during an informal meeting in
San Francisco at the AAD meeting on March 10, 2000. The Materials
and Methods section has since been refined based on the comments
and suggestions of the participants. The actual CNMD will start
on July 1 and last until October 31, 2000.
First World Congress of Dermoscopy Rome, February 23-25, 2001
The results of the CNMD will be
discussed with Consensus Board Members at the World Congress
of Dermoscopy in Rome.
Materials and Methods
Materials
a) Cases
-
Pigmented skin lesions included in this
study are taken from the histopathology files of the Department
of Dermatology, University Federico II of Naples (Italy);
the Department of Dermatology, University of LAquila
(Italy); the Sydney Melanoma Unit, Royal Prince Alfred Hospital,
Camperdown (Australia); and the Skin and Cancer Associates,
Plantation, Florida (USA). Each case includes complete information
regarding relevant clinical* and histopathologic data as
well as clinical and dermoscopic images and histopathologic
specimens.
*Clinical data:
-
age, sex, skin phototype, total number
of nevi (0-10, 10-50, >50), personal and/or familial
history of melanoma;
-
location, diameter, and duration of the
lesion, as well as the history concerning any morphologic
changes within the last year before the excision of the
lesion.
b) Standards used for Digital Documentation
of Dermoscopic Images (DDDI-standard)
-
Clinical and dermoscopic images of each
case will be obtained using a Heine Dermaphot, with 10-fold
magnification of the lesion. The color slides will be converted
to digital format using a Kodak Photo CD system.
-
Using Adobe Photoshop software all Photo
CD images (RGB, 768 x 512 pixel, 72 DPI)
will be converted to jpeg format with high compression rate.
Each image will then be optimized for color, brightness,
and contrast by using the Adobe Photoshop standards (Auto
levels and Unsharp mask).
c) Criteria for Cases Selection
-
Before inclusion in the study all digital
images will be evaluated by a professional photographer
for focus, brightness and contrast. Only images with excellent
quality will be considered for inclusion.
-
Only those lesions having a clinical diameter
of less than 14 x 10 mm will be included. This is a basic
prerequisite in order to evaluate the entire surface of
the lesion.
-
Each case will then be included in one of
two groups of pigmented skin lesions:
1) Stereotypical examples (low level of diagnostic difficulty);
2) Dermoscopically equivocal examples (high level of diagnostic
difficulty).
-
All cases will be evaluated by 3 participants
for their level of diagnostic difficulty.
Each case will be assigned to group 1 or 2 by the agreement
of at least 2 out of 3 evaluators.
-
All cases will be reviewed by a panel of
histopathologists and only histopathologically
unequivocal lesions will be included in the study.
Methods
-
128 pigmented skin lesions will be randomly
subdivided in a training set of 20 cases and a test
set of 108 cases.
-
On the Internet a tutorial will be given
concerning the unifying concept of dermoscopy
(with complete definitions of criteria and example images).
-
By entering a password each participant will
be able to evaluate 20 cases from the training set.
For each case, the participants will complete an electronic
data sheet.
Immediately following submission of the completed electronic
data sheet, the participants
will receive the set of dermoscopic criteria of that case
as judged by the instructors
and the histopathologic diagnosis provided by the histopathology
panelists.
This "real-time feedback" should assist the participant
in mastering the dermoscopic criteria.
Each participant will then be able to evaluate on-line 108
cases of the test set.
A total of 4 months (July 1 to October 31, 2000) will be
given for evaluating the cases.
-
At the end of the evaluation period, participants
will re-examine 20 randomly selected cases
from the test set of 108 cases, to test for intra-observer
agreement.
-
The histopathologic diagnoses for all cases
will be posted on the web site after October 31, 2000,
following evaluation of all cases of the test set and evaluation
of the additional 20 cases
for determining intra-observer agreement.
-
The minimum required resolution for viewing
cases is 800 x 600 pixels and 24-bit color graphic capability.
-
At the beginning of the study all participants
will be asked to complete a questionnaire concerning their
personal experience in dermoscopy and their preferred diagnostic
methods, as well as their computer system specifications.
This information will be useful in understanding whether
the variability of the dermoscopic methods used in clinical
practice, the different learning curves and training experiences,
and the various computer systems used are affecting the
intra- and inter-observer agreement
on dermoscopic criteria and dermoscopic diagnoses.
Electronic Data Sheet for evaluation of cases
This will include:
-
Clinical information concerning lesion and
patient (see Materials section)
-
Melanocytic algorithm for differentiating
melanocytic and non-melanocytic pigmented skin lesions
-
Criteria for diagnosing melanocytic lesions
by pattern analysis
-
Criteria for the ABCD rule with automatic
calculation of the score
-
Criteria for the 7-point checklist with automatic
calculation of the score
-
Criteria for the Menzies' method with automatic
calculation of the score
-
Diagnostic categories for the final diagnosis
together with management recommendations and final remarks
Statistics
Rates of inter- and intra-observer
agreement will be analyzed by calculating the percentage of
agreement and determining the k index for agreement compared
with that expected by chance alone for each descriptor considered
[Fleiss LJ, 1981]. Subgroups of descriptors will be compared
for distribution of the associated k values by means of two-tailed
Wilcoxon rank-sum test. A k value of 1.0 indicates full agreement
beyond chance. Values greater than 0.75 are generally considered
excellent, values between 0.40 and 0.75 fair to good, and values
less then 0.40 poor [Fleiss LJ, 1981]. A p value <= 0.05
(two-sided) will be considered statistically significant.
Donner's method will be used
to calculate the sample size for testing two coefficients of
inter-observer agreement [Donner A, 1998; Walter et al, 1998].
The sample size requirement for comparing two coefficients of
inter-observer agreement will be calculated for dichotomous
and continuous outcome variables.
In order to measure intra-class
agreement between parameters, for each group we calculated two
different numbers of cases necessary to detect both a moderate
and a substantial strength of agreement. Choosing the level
of significance as alpha=0.05 and power (1-beta=0.80) for testing
H0: K1=K2 versus H1: K1 <> K2, the required sample size
for group testing would be 36 cases for moderate (k index of
agreement = 0.41-0.60) and 97 cases for substantial strength
of agreement (k= 0.61-0.80).
Additional statistical analysis
will be conducted to test the validity of procedures to detect
melanoma.
For this reason, cases will be divided in two groups (melanoma
and non-melanoma), based on results of the histopathologic assessment.
Using a univariate analysis, the relationship of each criterion
with the diagnosis of melanoma will be evaluated by chi-square
analysis. Logistic regression analysis will be used to test
the diagnostic impact of the whole set of dermoscopic criteria
in a multivariate approach, and the probability of each case
belonging to the group of melanomas will be calculated. Sensitivity,
specificity, efficacy and diagnostic accuracy will be assessed
for clinical diagnosis, dermoscopic diagnosis, and the diagnosis
provided by the logistic regression approach.
References
-
Donner A. Sample size requirement
for the comparison of two or more coefficients of inter-observer
agreement.
Statist Med 1998;17:1157-1168.
-
Walter SD, Eliasziw M, Donner A.
Sample size and optimal designs for reliability studies.
Statist Med 1998;17:101-110.
-
Fleiss LJ. Statistical methods for
rate and proportions. 2nd ed. New York, John Wiley &
Sons, 1981: 212-236.
|