Publication:
Interpretability of deep learning models: a survey of results

dc.contributor.authorChakraborty, S.
dc.contributor.authorTomsett, R.
dc.contributor.authorRaghavendra, R.
dc.contributor.authorHarborne, D.
dc.contributor.authorAlzantot, M.
dc.contributor.authorCerutti, F.
dc.contributor.authorSrivastava, M.
dc.contributor.authorPreece, A.
dc.contributor.authorJulier, S.
dc.contributor.authorRao, R. M.
dc.contributor.authorKelley, T. D.
dc.contributor.authorBraines, D.
dc.contributor.authorŞensoy, Murat
dc.contributor.authorWillis, C. J.
dc.contributor.authorGurram, P.
dc.contributor.departmentComputer Science
dc.contributor.ozuauthorŞENSOY, Murat
dc.date.accessioned2020-04-30T13:27:34Z
dc.date.available2020-04-30T13:27:34Z
dc.date.issued2018-06-26
dc.description.abstractDeep neural networks have achieved near-human accuracy levels in various types of classification and prediction tasks including images, text, speech, and video data. However, the networks continue to be treated mostly as black-box function approximators, mapping a given input to a classification output. The next step in this human-machine evolutionary process - incorporating these networks into mission critical processes such as medical diagnosis, planning and control - requires a level of trust association with the machine output. Typically, statistical metrics are used to quantify the uncertainty of an output. However, the notion of trust also depends on the visibility that a human has into the working of the machine. In other words, the neural network should provide human-understandable justifications for its output leading to insights about the inner workings. We call such models as interpretable deep networks. Interpretability is not a monolithic notion. In fact, the subjectivity of an interpretation, due to different levels of human understanding, implies that there must be a multitude of dimensions that together constitute interpretability. In addition, the interpretation itself can be provided either in terms of the low-level network parameters, or in terms of input features used by the model. In this paper, we outline some of the dimensions that are useful for model interpretability, and categorize prior work along those dimensions. In the process, we perform a gap analysis of what needs to be done to improve model interpretability.en_US
dc.description.sponsorshipUnited States Department of Defense US Army Research Laboratory (ARL) ; UK Ministry of Defence
dc.identifier.doi10.1109/UIC-ATC.2017.8397411en_US
dc.identifier.endpage6en_US
dc.identifier.isbn978-1-5386-0435-9
dc.identifier.scopus2-s2.0-85050232544
dc.identifier.startpage1en_US
dc.identifier.urihttp://hdl.handle.net/10679/6547
dc.identifier.urihttps://doi.org/10.1109/UIC-ATC.2017.8397411
dc.identifier.wos000464418300020
dc.language.isoengen_US
dc.publicationstatusPublisheden_US
dc.publisherIEEEen_US
dc.relation.ispartof2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI)
dc.relation.publicationcategoryInternational
dc.rightsinfo:eu-repo/semantics/restrictedAccess
dc.titleInterpretability of deep learning models: a survey of resultsen_US
dc.typeConference paperen_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery85662e71-2a61-492a-b407-df4d38ab90d7

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections