Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix UNet implementation with arbitrary channel sizes (#243) #276

Open
wants to merge 10 commits into
base: master
Choose a base branch
from
23 changes: 13 additions & 10 deletions src/convnets/unet.jl
Original file line number Diff line number Diff line change
Expand Up @@ -71,18 +71,21 @@ Backbone of any Metalhead ResNet-like model can be used as encoder
- `final`: final block as described in original paper
- `fdownscale`: downscale factor
"""
function unet(encoder_backbone, imgdims, outplanes::Integer,
final::Any = unet_final_block, fdownscale::Integer = 0)
backbonelayers = collect(flatten_chains(encoder_backbone))
layers = unetlayers(backbonelayers, imgdims; m_middle = unet_middle_block,
skip_upscale = fdownscale)
function unet(encoder_backbone, imgdims, inchannels::Integer, outplanes::Integer,
final::Any = unet_final_block, fdownscale::Integer = 0)
backbonelayers = collect(flatten_chains(encoder_backbone))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please pay attention to the formatting, you lost the indentation here


outsz = Flux.outputsize(layers, imgdims)
layers = Chain(layers, final(outsz[end - 1], outplanes))
# Adjusting input size to include channels
adjusted_imgdims = (imgdims..., inchannels, 1)

return layers
end
layers = unetlayers(backbonelayers, adjusted_imgdims; m_middle = unet_middle_block,
skip_upscale = fdownscale)

outsz = Flux.outputsize(layers, adjusted_imgdims)
layers = Chain(layers, final(outsz[end - 1], outplanes))

return layers
end
"""
UNet(imsize::Dims{2} = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,
encoder_backbone = Metalhead.backbone(DenseNet(121)); pretrain::Bool = false)
Expand Down Expand Up @@ -114,7 +117,7 @@ end

function UNet(imsize::Dims{2} = (256, 256), inchannels::Integer = 3, outplanes::Integer = 3,
encoder_backbone = Metalhead.backbone(DenseNet(121)); pretrain::Bool = false)
layers = unet(encoder_backbone, (imsize..., inchannels, 1), outplanes)
layers = unet(encoder_backbone, imsize, inchannels, outplanes)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

inchannels should somehow be passed in to the encoder backbone here. Of course, we will have to decide how to deal with this in case the user passes in a model with this initialised and also separately inchannels

model = UNet(layers)
if pretrain
artifact_name = "UNet"
Expand Down
Loading