Thanks for your efforts here, I study what you did.
For some context.
This image is part of a visualization course in 3DSMax and in this case to recreate this scene. The whole point is to be able to match an image without known information like a floorpan, focal length and so on.
The first stage was to find the vanishing point and ensure it is in the middle of the image. To do this crops are added to the image to make sure the vanishing point is in the middle via rulers in Photoshop set at 50%. It seems if the vanishing point isn't in the middle the perspective match won't work correctly.
After that we have a grid overlay where each ground plane square is half a meter. So with this an approx distance can be calculated on the floor. Lastly, we try to find something in the scene that has a known size. In this case it could be the bar height which are often around 104-107cm or so according to Google. Bar stools are also often at a standardized height as well.
Where 3DSMax makes this process easier is in the perspective match tool. Once the image is in the viewport you can simply use the tool to drag the ground plane to where it should be. From there create a cube that matches the dimensions of something in the scene that you identified earlier, in this case the bar. So say a cube 104cm and 600cm wide. Using the distance and vertical adjustments of the perspective match tool, you can line up your cube with the bar until they match. Add a camera and set its height to around 120cm and turn on Auto Correct Vertical Tilt and that's it.
As I prefer using Cinema (I'm on a Mac) I was trying to camera match it with C4D and I found, like you, the process harder to do with mixed results. The focal length in your version of the scene is 90mm which would be unlikely. I have a 19mm and 45mm Tilt/Shift for work and the 45 would be way too tight for this shot, even if the images were stitched vertically.
When using the Camera Calibrator in C4D the focal length varied a lot, anything from 35 to 300mm lol. It seemed the vertical grid lines (y) were the culprit. Adding an extra line could reduce or increase the focal length. I also found it hard if impossible to get the vertical grid lines in the calibrator to be perfectly straight and they can't be snapped it seems. I'm guessing this one of the reasons why the pitch and roll of the camera co-ordinates wouldn't be at 0 and you can't adjust them.
I've included some screenshots so you can see what I'm referring to 
https://www.icloud.com/iclouddrive/02cb5R_LhEHeotDRFQOAkTwtQ#Camera_Match_Screenshots