Diario de optilete

03 de noviembre de 2020

Overview of Computer Vision in iNaturalist

Overzicht van Computer Vision in iNaturalist

Corresponding author: Ken-ichi Ueda (kueda@inaturalist.org)
Received: 29 Sep 2020 | Published: 01 Oct 2020
Citation: Ueda K-i (2020) An Overview of Computer Vision in iNaturalist. Biodiversity Information Science and
Standards 4: e59133. https://doi.org/10.3897/biss.4.59133

Abstract
iNaturalist is a social network of people who record and share observations of biodiversity.
For several years, iNaturalist has been employing computer vision models trained on
iNaturalist data to provide automated species identification assistance to iNaturalist
participants. This presentation offers an overview of how we are using this technology, the
data and tools we used to create it, challenges we have faced in its development, and
ways we might apply it in the future.
Presenting author
Ken-ichi Ueda
Presented at
TDWG 2020

It's available here: https://www.youtube.com/watch?v=xfbabznYFV0

GEbruiker iNAturalist
https://forum.inaturalist.org/t/what-i-learned-after-training-my-own-computer-vision-model-on-inats-data/44052

Publicado el noviembre 3, 2020 07:42 TARDE por

optilete | 1 comentario | Deja un comentario

iNaturalist en het Computer Vision, een korte geschiedenis

Op dit moment is Inaturalist al weer bezig met de zesde versie van het Computer Kijk (Computer Vision) model
waarbij in September 2020 18 miljoen fotos apart gezet zijn waarme zo'n 35.000 soorten wereld wijd herkend kunnen worden.
De aanpak is het zelfde als voor model 5 alleen met veel meer fotos omdat er nu veel meer soorten in iNaturalist
2000 fotos heeft. In het verleden werden wel meer dan 2000 fotos per soort gebruikt maar de extra rekenkracht weegt niet op tegen het succes.
In totaal zal het doorrekenen van het model 210 dagen kosten en in het voorjaar van 2021 klaar zijn.
Naast het doorrekenen van hetzelfde model met meer fotos en meer soorten wordt tgelijkertijd het huidige systeem vergeleken
met "TensorFlow 2, Xception vs Inception" wat waarschijnlijk ditzelfde model niet in 210 dagen maar in 60 dagen doorrekend.
Als dit nieuwe TensorFlow 2, Xception vs Inception goed werkt dan wordt het zelfs nog winter 2021 een nieuwe model opgeleverd.
Om dit door rekenen was een nieuwe hardware besteld maar door COVID is dit nog niet geinstalleerd.
In het huidige model zijn 25.000 van de 300.000 soorten die waargenomen zijn in iNaturalist.
https://www.inaturalist.org/blog/42626-we-passed-300-000-species-observed-on-inaturalist#comments

Hoe wordt nu bepaald of een soort opgenomen wordt in het model ?
Als van een soort 100 waarnemingen met foto waarvan er minsten 50 een Research Grade community ID heeft wordt opgenomen in de training. (actually, that’s really verifiable + would-be-verifiable-if-not-captive, In het model worden ook ontsnapte en gekweekte soorten opgenomen). Voor de training wordt dus niet alleen en uitsluitend Research Grade fotos gebruikt.

Globaal waren de oude versies:
May 2017 Model 1
Aug 2017 Model 2
Jan 2018 Model 3
Feb 2019 Model 4
Sep 2019 Model 5
Mar 2020 Model 6

Training

Training Set 1

In deze groep zitten geidentificeerde met

De waarneming heeft een Taxon of een Genus, Familie toegewezen
De waarneming heeft geen flags
De waarneming heeft alle quality metrics gehaald behalve het toegestasnde wild / naturalized, dit zijn items die genoemd worden in de DQA, Quality Assesment

Validation Set 1

Met deze groep fotos wordt tijdens de training de voortgang van de training bekeken, een Toets of Examen dat het trainingmodel moet afleggen. De eisen aan deze validatieset zijn hetzelfde als van de Training Set 1 maar het is maar 5% van het aantal fotos.

TestSet 1

Met deze groep fotos wordt als de training is afgelopen gekeken of het model goed werkt. Het betreft uitsluitend
fotos met een Community taxon, dus fotos die waarschijnlijk wel goed moeten zijn omdat meerdere personen een determinatie toegeveogd hebben aan de waarneming.
Het bijzondere is dus dat aan de training ook minder zekere fotos mee mogen doen terwijl het testen tegen absoluut zekere waarnemingen gedaan wordt.
Zie ook https://forum.inaturalist.org/t/identification-quality-on-inaturalist/7507
Om te voorkomen dat er te veel soorten zijn waarvan er te weinig fotos zijn worden er niet te veel beperkingen aan de fotos gesteld. In de toekomst worden de eisen misschien strenger

Fotos van Nieuwe gebruikers
CID'd obs, waarnemingen met alleen een Computer Vision ID
vision-based ID
Gebruik geen fotos van IDs by users with X maverick IDs

Het computer is niet te downloaden maar misschien dat er later nog een API komt. Training your own with https://www.kaggle.com/c/inaturalist-challenge-at-fgvc-2017

Croppen van fotos, Volgorde, Best Photo First

Al hoewel het op iNaturalist neit vaak gezegd wordt is het Croppen van een foto een goede methode om betere resultaten te krijgen.
Het model neemt ook geografische data nog niet echt mee. In het verleden werden enorme aantallen Californische soorten voorgesteld maar in de loop van de modellen is dat wel afgenomen.

Best Photo First
Het is naast croppen erg verstandig om je beste foto het eerste neer te zetten omdat het model alleen de eerste foto van de waarneming gebruikt om een voorstel voor de soort te doen.
De locatie, nauwkeurigheid van een foto die je neemt buiten de iNat app om is meestasl minder nauwkeurig dan wanner je de interne app gebruikt van iNat. Ook kun je dan inzoomen met je vingers spread out, zodat je de crop functionaliteit niet hoeft te gebruiken. Het model gebruikt niet het tijd van het seizoen (eikels en kastanjes in de herfst, Trekvogels in voorjaar en herfst. Geen zomervogels als gierzwaluw in de winter en verspreidinggegevens van soorten.. ALpenroosjes worden niet tot de ALpen beperkt.

In 2017 the amount of recognised species was 20.000 and now it is still.....20.000?

https://www.inaturalist.org/pages/help#cv-taxa
FWIW, there's also discussion and some additional charts at https://forum.inaturalist.org/t/psst-new-vision-model-released/10854/11
https://forum.inaturalist.org/t/identification-quality-on-inaturalist/7507
https://www.pyimagesearch.com/2017/03/20/imagenet-vggnet-resnet-inception-xception-keras/
https://www.inaturalist.org/posts/31806-a-new-vision-model#activity_comment_5763380

Neural Networks (specifically, VGG16) pre-trained on the ImageNet dataset with Python and the Keras deep learning library.

The pre-trained networks inside of Keras are capable of recognizing 1,000 different object categories, similar to objects we encounter in our day-to-day lives with high accuracy.

Back then, the pre-trained ImageNet models were separate from the core Keras library, requiring us to clone a free-standing GitHub repo and then manually copy the code into our projects.

This solution worked well enough; however, since my original blog post was published, the pre-trained networks (VGG16, VGG19, ResNet50, Inception V3, and Xception) have been fully integrated into the Keras core (no need to clone down a separate repo anymore) — these implementations can be found inside the applications sub-module.

Because of this, I’ve decided to create a new, updated tutorial that demonstrates how to utilize these state-of-the-art networks in your own classification projects.

Specifically, we’ll create a special Python script that can load any of these networks using either a TensorFlow or Theano backend, and then classify your own custom input images.

To learn more about classifying images with VGGNet, ResNet, Inception, and Xception, just keep reading.

= = = = = = = = = = = = = = = = =

https://towardsdatascience.com/xception-from-scratch-using-tensorflow-even-better-than-inception-940fb231ced9

Xception: Implementing from scratch using Tensorflow
Even better than Inception
Convolutional Neural Networks (CNN) have come a long way, from the LeNet-style, AlexNet, VGG models, which used simple stacks of convolutional layers for feature extraction and max-pooling layers for spatial sub-sampling, stacked one after the other, to Inception and ResNet networks which use skip connections and multiple convolutional and max-pooling blocks in each layer. Since its introduction, one of the best networks in computer vision has been the Inception network. The Inception model uses a stack of modules, each module containing a bunch of feature extractors, which allow them to learn richer representations with fewer parameters.
Xception paper — https://arxiv.org/abs/1610.02357

= = = = = = = = = = = = = = = = = = = = =
https://towardsdatascience.com/review-xception-with-depthwise-separable-convolution-better-than-inception-v3-image-dc967dd42568
Inthis story, Xception [1] by Google, stands for Extreme version of Inception, is reviewed. With a modified depthwise separable convolution, it is even better than Inception-v3 2 for both ImageNet ILSVRC and JFT datasets. Though it is a 2017 CVPR paper which was just published last year, it’s already had more than 300 citations when I was writing this story. (Sik-Ho Tsang @ Medium)

= = = = = = = = = = = = = = = = = = = = = = = = = = = =
https://laptrinhx.com/xception-from-scratch-using-tensorflow-even-better-than-inception-212761016/
Convolutional Neural Networks (CNN) have come a long way, from the LeNet-style, AlexNet, VGG models, which used simple stacks of convolutional layers for feature extraction and max-pooling layers for spatial sub-sampling, stacked one after the other, to Inception and ResNet networks which use skip connections and multiple convolutional and max-pooling blocks in each layer. Since its introduction, one of the best networks in computer vision has been the Inception network. The Inception model uses a stack of modules, each module containing a bunch of feature extractors, which allow them to learn richer representations with fewer parameters.

Xception paper — https://arxiv.org/abs/1610.02357

Publicado el noviembre 3, 2020 09:01 TARDE por

optilete | 7 comentarios | Deja un comentario

19 de noviembre de 2020

Ancestor disagreements

https://www.inaturalist.org/posts/25514-clarifying-ancestor-disagreements#activity_comment_5800586

Publicado el noviembre 19, 2020 11:16 TARDE por

optilete | 1 observación | 0 comentarios | Deja un comentario

20 de noviembre de 2020

Nederlandse namen talrijkst aanwezig op iNaturalist na het Engels

Nederlandse (vogel)namen talrijk aanwezig op iNaturalist(32)
Hieronder een overzicht van locale landelijke namen van Reptielen, amfibieen en vogels van alle talen die aanwezig zijn op iNaturalist...(In totaal zijn er wel 400 lexicons aanwezig op iNaturalist.org). De gegevens zijn al een paar maanden oud en zullen in de loop van de tijd ongetwijfeld veranderd zijn. De gegevens zijn verzameld om te kijken hoeveel soorten er missen per taal.

Het nederlandse namen komen drie keer vaker voor als bijvoorbeeld de duitse of franse namen.
https://www.inaturalist.org/journal/ahospers/44123-nederlandse-vogel-namen-talrijk-aanwezig-op-inaturalist-32

Publicado el noviembre 20, 2020 11:10 TARDE por

optilete | 1 comentario | Deja un comentario

24 de noviembre de 2020

Nuttige Natuurgidsen vaak geschreven als Journaalpost, dagboek op iNaturalist(36)

Helpful Identification Guides
Locale Dagboekpsoten en externe sites die nuttig zijn als gids

TOPICS AUTHOR

Acanthocephalus: Spine-Headed/Leaf-footed Bugs Zootaxa article
*Agalinis: Identifying Agalinis spp. (False Foxgloves) in Texas @pfau_tarleton
Anemones: Guide to ID @pfau_tarleton
Assassin Bugs ( Zelus spp.) @pfau_tarleton
Broomrape Species @blue_celery
Broomweed: Amphiachyris dracunculoides and look-alikes @rymcdaniel
Brambles: Dewberries and Blackberries ( Rubus species of Texas) @kimberlietx
Bumblebees @pfau_tarletonr
Crotons @nathantaylor
Dandelions @nathantaylor
Draba and related genera @pfau_tarleton
Elms: American vs. Slippery @lisa281
Elms: Cedar vs. Winged @lisa281
*Elms: Chinese vs. Siberian @lisa281
Elm spring samaras: American, Slippery, and Winged Elm @lisa281
Erigeron (Fleabane and Horseweed) @lisa281
Frogfruits (Phyla) @lisa281
Galls: Texas Woolly Oak Galls @kimberlietx
Gophers vs moles (signs) @pfau_tarleton
Hawks - Cooper's vs Sharp-shinned Cornell
Leaves: Glossary of Leaves @kimberlietx
Medicago (Medicks) @nathantaylor
Mosses: Identifying Goblet Mosses @rmedina
Moths: Guide to Petrophila Moths in Texas @gcwarbler
Moth Wing Features @mamestraconfigurata
Mushrooms: Simplified Key to Major Groups of Mushrooms Michael Kuo @ MushroomExpert.Com
Mushrooms: Photographing for better ID Billy Stone, BRIT
Packera and Senecio (Groundsels) @lisa281
Privets ( Ligustrum spp.) @lisa281
Sesbania (Riverhemps)</ @lisa281
Soapberry vs. Pistache @baldeagle
Solidago (Goldenrods) @bouteloua
Spurges of the DFW area @nathantaylor
Sumac (Rhus spp.) YouTube video @conboy
Sumacs: Key to ID the Rhus spp of North America @conboy
Tetrigidae Pygmy Grasshoppers @aispinsects
Thistles: Identifying Texas Thistles Katie Stern at Perennial Ecology
Three-Banded LeafhoppersErythroneura spp. @kimberlietx
Tick Identification TickEncounter Resource Center
Trees: Identify sometimes difficult trees @lanechaffin
Trees: Key to the Broadleaf Trees of North Central Texas @lisa281
Trees: First Steps in Tree ID (You Tube Video) @lisa281
Turkey Tail and Lookalikes @sarahduhon
o.
Illustrated glossary of leaves: https://www.inaturalist.org/posts/30456-illustrated-glossary-of-leaves
Three banded Leafhoppers/Erythroneura spp et al: https://www.inaturalist.org/posts/25070-three-banded-leafhoppers-erythroneura-spp-et-al Nuttige Natuurgidsen vaak geschreven als Journaalpost, dagboek op iNaturalist(36)

Publicado el noviembre 24, 2020 11:01 TARDE por

optilete | 1 observación | 5 comentarios | Deja un comentario

28 de noviembre de 2020

Nauwkeurigheid van de locatie.

Als je de lijst van waarnemers wil hebben met onnauwkeurigheid dan
https://www.inaturalist.org/observations?acc=false&place_id=any&subview=table&view=observers

Publicado el noviembre 28, 2020 09:57 TARDE por

optilete | 3 comentarios | Deja un comentario

30 de noviembre de 2020

Werkgroep ''De bryoflora van Park Oudegein en een deel van het IJsselbos'' opgericht

De bryoflora van ParkOudegein en een deel van het IJsselbos te Nieuwegein is in 2002 onderzocht door Dick Kerkhof e.o. Hij heeft er een mooi verslag met de titel ''De bryoflora van Park Oudegein en een deel van het IJsselbos'' over geschreven dat overal op het internet te vinden is.

Op 30 november 2020 is een nieuw aan IVN gelieerde werkgroep opgericht om het Park Oudegein en omgeving te inventariserenop bladmossen, veenmossen, hauwmossen, levermossen en korstmossen. Daarnaast wordt ook de Bosschewaard, de fruitbomen in IJsselstein en het bos bij Oostbroek onderzocht.

Publicado el noviembre 30, 2020 11:03 TARDE por

optilete | 0 comentarios | Deja un comentario

Archivos de diario de noviembre 2020