In order to call a Google Cloud API from your app, you need to create an intermediate REST API that handles authorization and protects secret values such as API keys. You then need to write code in your mobile app to authenticate to and communicate with this intermediate service.
One way to create this REST API is by using Firebase Authentication and Functions, which gives you a managed, serverless gateway to Google Cloud APIs that handles authentication and can be called from your mobile app with pre-built SDKs.
This guide demonstrates how to use this technique to call the Cloud Vision API from your app. This method will allow all authenticated users to access Cloud Vision billed services through your Cloud project, so consider whether this auth mechanism is sufficient for your use case before proceeding.
Before you begin
Configure your project
If you have not already added Firebase to your app, do so by following the steps in the getting started guide.Use Swift Package Manager to install and manage Firebase dependencies.
- In Xcode, with your app project open, navigate to File > Add Packages.
- When prompted, add the Firebase Apple platforms SDK repository:
- Choose the Firebase ML library.
- Add the -ObjCflag to the Other Linker Flags section of your target's build settings.
- When finished, Xcode will automatically begin resolving and downloading your dependencies in the background.
https://github.com/firebase/firebase-ios-sdk.git
Next, perform some in-app setup:
- In your app, import Firebase:
  Swiftimport FirebaseMLModelDownloader Objective-C@import FirebaseMLModelDownloader; 
A few more configuration steps, and we're ready to go:
- 
  If you haven't already enabled Cloud-based APIs for your project, do so now: - Open the Firebase ML APIs page in the Firebase console.
- 
      If you haven't already upgraded your project to the pay-as-you-go Blaze pricing plan, click Upgrade to do so. (You'll be prompted to upgrade only if your project isn't on the Blaze pricing plan.) Only projects on the Blaze pricing plan can use Cloud-based APIs. 
- If Cloud-based APIs aren't already enabled, click Enable Cloud-based APIs.
 
- Configure your existing Firebase API keys to disallow access to the Cloud
    Vision API:
    - Open the Credentials page of the Cloud console.
- For each API key in the list, open the editing view, and in the Key Restrictions section, add all of the available APIs except the Cloud Vision API to the list.
 
Deploy the callable function
Next, deploy the Cloud Function you will use to bridge your app and the Cloud
  Vision API. The functions-samples repository contains an example
  you can use.
By default, accessing the Cloud Vision API through this function will allow only authenticated users of your app access to the Cloud Vision API. You can modify the function for different requirements.
To deploy the function:
- Clone or download the functions-samples repo
    and change to the Node-1st-gen/vision-annotate-imagedirectory:git clone https://github.com/firebase/functions-samplescd Node-1st-gen/vision-annotate-image
- Install dependencies:
cd functionsnpm installcd ..
- If you don't have the Firebase CLI, install it.
- Initialize a Firebase project in the vision-annotate-imagedirectory. When prompted, select your project in the list.firebase init 
- Deploy the function: firebase deploy --only functions:annotateImage 
Add Firebase Auth to your app
The callable function deployed above will reject any request from non-authenticated users of your app. If you have not already done so, you will need to add Firebase Auth to your app.
Add necessary dependencies to your app
Use Swift Package Manager to install the Cloud Functions for Firebase library.
1. Prepare the input image
In order to call Cloud Vision, the image must be formatted as a base64-encoded string. To process aUIImage:
Swift
guard let imageData = uiImage.jpegData(compressionQuality: 1.0) else { return } let base64encodedImage = imageData.base64EncodedString()
Objective-C
NSData *imageData = UIImageJPEGRepresentation(uiImage, 1.0f); NSString *base64encodedImage = [imageData base64EncodedStringWithOptions:NSDataBase64Encoding76CharacterLineLength];
2. Invoke the callable function to recognize landmarks
To recognize landmarks in an image, invoke the callable function passing a JSON Cloud Vision request.- First, initialize an instance of Cloud Functions: - Swift- lazy var functions = Functions.functions()- Objective-C- @property(strong, nonatomic) FIRFunctions *functions;
- Create a request with Type set to - LANDMARK_DETECTION:- Swift- let requestData = [ "image": ["content": base64encodedImage], "features": ["maxResults": 5, "type": "LANDMARK_DETECTION"] ]- Objective-C- NSDictionary *requestData = @{ @"image": @{@"content": base64encodedImage}, @"features": @{@"maxResults": @5, @"type": @"LANDMARK_DETECTION"} };
- Finally, invoke the function: - Swift- do { let result = try await functions.httpsCallable("annotateImage").call(requestData) print(result) } catch { if let error = error as NSError? { if error.domain == FunctionsErrorDomain { let code = FunctionsErrorCode(rawValue: error.code) let message = error.localizedDescription let details = error.userInfo[FunctionsErrorDetailsKey] } // ... } }- Objective-C- [[_functions HTTPSCallableWithName:@"annotateImage"] callWithObject:requestData completion:^(FIRHTTPSCallableResult * _Nullable result, NSError * _Nullable error) { if (error) { if ([error.domain isEqualToString:@"com.firebase.functions"]) { FIRFunctionsErrorCode code = error.code; NSString *message = error.localizedDescription; NSObject *details = error.userInfo[@"details"]; } // ... } // Function completed succesfully // Get information about labeled objects }];
3. Get information about the recognized landmarks
If the landmark recognition operation succeeds, a JSON response of
BatchAnnotateImagesResponse
will be returned in the task's result. Each object in the landmarkAnnotations
array represents a landmark that was recognized in the image. For each landmark,
you can get its bounding coordinates in the input image, the landmark's name,
its latitude and longitude, its Knowledge Graph entity ID (if available), and
the confidence score of the match. For example:
Swift
if let labelArray = (result?.data as? [String: Any])?["landmarkAnnotations"] as? [[String:Any]] {
  for labelObj in labelArray {
    let landmarkName = labelObj["description"]
    let entityId = labelObj["mid"]
    let score = labelObj["score"]
    let bounds = labelObj["boundingPoly"]
    // Multiple locations are possible, e.g., the location of the depicted
    // landmark and the location the picture was taken.
    guard let locations = labelObj["locations"] as? [[String: [String: Any]]] else { continue }
    for location in locations {
      let latitude = location["latLng"]?["latitude"]
      let longitude = location["latLng"]?["longitude"]
    }
  }
}
Objective-C
NSArray *labelArray = result.data[@"landmarkAnnotations"];
for (NSDictionary *labelObj in labelArray) {
  NSString *landmarkName = labelObj[@"description"];
  NSString *entityId = labelObj[@"mid"];
  NSNumber *score = labelObj[@"score"];
  NSArray *bounds = labelObj[@"boundingPoly"];
  // Multiple locations are possible, e.g., the location of the depicted
  // landmark and the location the picture was taken.
  NSArray *locations = labelObj[@"locations"];
  for (NSDictionary *location in locations) {
    NSNumber *latitude = location[@"latLng"][@"latitude"];
    NSNumber *longitude = location[@"latLng"][@"longitude"];
  }
}